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<400> 1 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 600 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 660 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 720 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 780 
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 840 
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 900 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 960 
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cagtgctgcc ataaccatga gtgataacac 
aggaccgaag gagctaaccg cttttttgca 
tcgttgggaa ccggagctga atgaagccat 
tgcagcaatg gcaacaacgt tgcgcaaact 
ccggcaacaa ttaatagact ggatggaggc 
ggcccttccg gctggctggt ttattgctga 
cggtatcatt gcagcactgg ggccagatgg 
gacggggagt caggcaacta tggatgaacg 
actgattaag cattggtaac tgtcagacca 
aaaacttcat ttttaattta aaaggatcta 
caaaatccct taacgtgagt tttcgttcca 
aggatcttct tgagatcctt tttttctgcg 
accgctacca gcggtggttt gtttgccgga 
aactggcttc agcagagcgc agataccaaa 
ccaccacttc aagaactctg tagcaccgcc 
agtggctgct gccagtggcg ataagtcgtg 
accggataag gcgcagcggt cgggctgaac 
gcgaacgacc tacaccgaac tgagatacct 
tcccgaaggg agaaaggcgg acaggtatcc 
cacgagggag cttccagggg gaaacgcctg 
cctctgactt gagcgtcgat ttttgtgatg 
cgccagcaac gcggcctttt tacggttcct 
ctttcctgcg ttatcccctg attctgtgga 
taccgctcgc cgcagccgaa cgaccgagcg 
gcgcctgatg cggtattttc tccttacgca 
tgcactctca gtacaatctg ctctgatgcc 
cgctacgtga ctgggtcatg gctgcgcccc 
gacgggcttg tctgctcccg gcatccgctt 
gcatgtgtca gaggttttca ccgtcatcac 
catcagcgtg gtcgtgaagc gattcacaga 
tgagtttctc cagaagcgtt aatgtctggc 



tgcggccaac ttacttctga caacgatcgg 1020 
caacatgggg gatcatgtaa ctcgccttga 1080 
accaaacgac gagcgtgaca ccacgatgcc 1140 
attaactggc gaactactta ctctagcttc 1200 
ggataaagtt gcaggaccac ttctgcgctc 1260 
taaatctgga gccggtgagc gtgggtctcg 1320 
taagccctcc cgtatcgtag ttatctacac 1380 
aaatagacag atcgctgaga taggtgcctc 1440 
agtttactca tatatacttt agattgattt 1500 
ggtgaagatc ctttttgata atctcatgac 1560 
ctgagcgtca gaccccgtag aaaagatcaa 1620 
cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
tcaagagcta ccaactcttt ttccgaaggt 1740 
tactgtcctt ctagtgtagc cgtagttagg 1800 
tacatacctc gctctgctaa tcctgttacc 1860 
tcttaccggg ttggactcaa gacgatagtt 1920 
ggggggttcg tgcacacagc ccagcttgga 1980 
acagcgtgag ctatgagaaa gcgccacgct 2040 
ggtaagcggc agggtcggaa caggagagcg 2100 
gtatctttat agtcctgtcg ggtttcgcca 2160 
ctcgtcaggg gggcggagcc tatggaaaaa 2220 
ggccttttgc tggccttttg ctcacatgtt 2280 
taaccgtatt accgcctttg agtgagctga 2340 
cagcgagtca gtgagcgagg aagcggaaga 2400 
tctgtgcggt atttcacacc gcatatatgg 24 60 
gcatagttaa gccagtatac actccgctat 2520 
gacacccgcc aacacccgct gacgcgccct 2580 
acagacaagc tgtgaccgtc tccgggagct 2640 
cgaaacgcgc gaggcagctg cggtaaagct 2700 
tgtctgcctg ttcatccgcg tccagctcgt 2760 
ttctgataaa gcgggccatg ttaagggcgg 2820 



3/52 



ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa 2880 
tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc 2940 
ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa 3000 
aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta 3060 
gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg 3120 
tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag 3180 
acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac 3240 
cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca 3300 
cccgtggcca ggacccaacg ctgcccgaga tctcgatccc gcgaaattaa tacgactcac 3360 
tatagggaga ccacaacggt ttccctctag aaataatttt gtttaacttt aagaaggaga 3420 



tatacat atg ggg gac get ccc age cct gaa gag aaa ctg cac ctt ate 
Met Gly Asp Ala Pro Ser Pro Glu Glu Lys Leu His Leu He 
15 10 



tta aag gca ggg tgt gag gta aca att ctg ttt gcg gac etc cac gca 
Leu Lys Ala Gly Cys Glu Val Thr He Leu Phe Ala Asp Leu His Ala 

65 70 75 



3469 



ace egg aac ctg cag gag gtt ctg ggg gaa gag aag ctg aag gag ata 3517 
Thr Arg Asn Leu Gin Glu Val Leu Gly Glu Glu Lys Leu Lys Glu He 
15 20 25 30 

ctg aag gag egg gaa ctt aaa att tac tgg gga acg gca ace acg ggc 3565 
Leu Lys Glu Arg Glu Leu Lys He Tyr Trp Gly Thr Ala Thr Thr Gly 

35 40 45 

aaa cca cat gtg get tac ttt gtg ccc atg tea aag att gca gac ttc 3613 
Lys Pro His Val Ala Tyr Phe Val Pro Met Ser Lys He Ala Asp Phe 

50 55 60 



3661 



tac ctg gat aac atg aaa gec cca tgg gaa ctt eta gaa etc cga gtc 3709 
Tyr Leu Asp Asn Met Lys Ala Pro Trp Glu Leu Leu Glu Leu Arg Val 
80 85 90 

agt tac tat gag aat gtg ate aaa gca atg ctg gag age att ggt gtg 3757 
Ser Tyr Tyr Glu Asn Val He Lys Ala Met Leu Glu Ser He Gly Val 
95 100 105 HO 

ccc ttg gag aag etc aag ttc ate aaa ggc act gat tac cag etc age 3805 
Pro Leu Glu Lys Leu Lys Phe He Lys Gly Thr Asp Tyr Gin Leu Ser 

115 120 125 

aaa gag tac aca eta gat gtg tac aga etc tec tec gtg gtc aca cag 3853 
Lys Glu Tyr Thr Leu Asp Val Tyr Arg Leu Ser Ser Val Val Thr Gin 

130 135 140 

cac gat tec aag aag get gga get gag gtg gta aag cag gtg gag cac 3901 
His Asp Ser Lys Lys Ala Gly Ala Glu Val Val Lys Gin Val Glu His 
145 150 155 

cct ttg ctg agt ggc etc tta tac ccc gga ctg cag get ttg gat gaa 3949 
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Pro Leu Leu Ser Gly Leu Leu Tyr Pro Gly Leu Gin Ala Leu Asp Glu 
160 165 170 . 

gag tat tta aaa gta gat gcc caa ttt gga ggc att gat cag aga aag 
Glu Tvr Leu Lys Val Asp Ala Gin Phe Gly Gly He Asp Gin Arg Lys 
175 180 185 190 

att ttc acc ttt gca gag aag tac etc cct gca ctt ggc tat tea aaa 
He Phe Thr Phe Ala Glu Lys Tyr Leu Pro Ala Leu Gly Tyr Ser Lys 

195 200 205 

egg gtc cat ctg atg aat cct atg gtt cca gga tta aca ggc age aaa 
Arg Val His Leu Met Asn Pro Met Val Pro Gly Leu Thr Gly Ser Lys 

210 215 220 



gag gat gtg aag aaa aaa ctg aag aag gcc ttc tgt gag cca gga aat 
Glu Asp Val Lys Lys Lys Leu Lys Lys Ala Phe Cys Glu Pro Gly Asn 
240 245 250 

gtg gag aac aat ggg gtt ctg tec ttc ate aag cat gtc ctt ttt ccc 
Val Glu Asn Asn Gly Val Leu Ser Phe He Lys His Val Leu Phe Pro 
255 260 265 270 

ctt aag tec gag ttt gtg ate eta cga gat gag aaa tgg ggt gga aac 
Leu Lys Ser Glu Phe Val He Leu Arg Asp Glu Lys Trp Gly Gly Asn 

275 280 285 

aaa acc tac aca get tac gtg gac ctg gaa aag gac ttt get get gag 
Lys Thr Tyr Thr Ala Tyr Val Asp Leu Glu Lys Asp Phe Ala Ala Glu 

290 295 300 

gtt gta cat cct gga gac ctg aag aat tct gtt gaa gtc gca ctg aac 
Val Val His Pro Gly Asp Leu Lys Asn Ser Val Glu Val Ala Leu Asn 
305 310 315 

aag ttg ctg gat cca ate egg gaa aag ttt aat acc cct gcc ctg aaa 
Lys Leu Leu Asp Pro He Arg Glu Lys Phe Asn Thr Pro Ala Leu Lys 
320 325 330 

aaa ctg gcc age get gcc tac cca gat ccc tea aag cag aag cca atg 
Lys Leu Ala Ser Ala Ala Tyr Pro Asp Pro Ser Lys Gin Lys Pro Met 
335 340 345 350 

gcc aaa ggc cct gcc aag aat tea gaa cca gag gag gtc ate cca tec 
Ala Lys Gly Pro Ala Lys Asn Ser Glu Pro Glu Glu Val He Pro Ser 

355 360 365 



3997 



4045 



4093 



atg age tct tea gaa gag gag tec aag att gat etc ctt gat egg aag 4141 
Met Ser Ser Ser Glu Glu Glu Ser Lys He Asp Leu Leu Asp Arg Lys 
225 230 235 



cca egg act gtg gtg age ggc ctg gta cag ttc gtg ccc aag gag gaa 
Pro Arg Thr Val Val Ser Gly Leu Val Gin Phe Val Pro Lys Glu Glu 
400 405 410 



4189 



4237 



4285 



4333 



4381 



4429 



4477 



4525 



egg ctg gat ate cgt gtg ggg aaa ate ate act gtg gag aag cac cca 4573 
Arg Leu Asp He Arg Val Gly Lys He He Thr Val Glu Lys His Pro 

370 375 380 

gat gca gac age ctg tat gta gag aag att gac gtg ggg gaa get gaa 4 621 
Asp Ala Asp Ser Leu Tyr Val Glu Lys He Asp Val Gly Glu Ala Glu 
385 390 395 



4669 
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ctg cag gac agg ctg gta gtg gtg ctg tgc aac ctg aaa ccc cag aag 4717 
Leu Gin Asp Arg Leu Val Val Val Leu Cys Asn Leu Lys Pro Gin Lys 
415 420 425 430 



atg aga gga gtc gag tec caa ggc atg ctt ctg tgt get tct ata gaa 
Met Arg Gly Val Glu Ser Gin Gly Met Leu Leu Cys Ala Ser He Glu 

435 440 445 

ggg ata aac cgc cag gtt gaa cct ctg gac cct ccg gca ggc tct get 
Gly He Asn Arg Gin Val Glu Pro Leu Asp Pro Pro Ala Gly Ser Ala 

450 455 460 

cct ggt gag cac gtg ttt gtg aag ggc tat gaa aag ggc caa cca gat 
Pro Gly Glu His Val Phe Val Lys Gly Tyr Glu Lys Gly Gin Pro Asp 
465 470 475 

gag gag etc aag ccc aag aag aaa gtc ttc gag aag ttg cag get gac 
Glu Glu Leu Lys Pro Lys Lys Lys Val Phe Glu Lys Leu Gin Ala Asp 
480 485 490 

ttc aaa att tct gag gag tgc ate gca cag tgg aag caa ace aac ttc 
Phe Lys He Ser Glu Glu Cys He Ala Gin Trp Lys Gin Thr Asn Phe 
495 500 505 510 

atg ace aag ctg ggc tec att tec tgt aaa teg ctg aaa ggg ggg aac 
Met Thr Lys Leu Gly Ser He Ser Cys Lys Ser Leu Lys Gly Gly Asn 

515 520 525 



4765 



4813 



4861 



4909 



4957 



5005 



att age etc gag cac cac cac cac cac cac tgagatcegg ctgctaacaa 5055 
He Ser Leu Glu His His His His His His 

530 535 

ageccgaaag gaagctgagt tggctgctgc caccgctgag caataactag cataacccct 5115 

tggggectet aaaegggtet tgaggggttt tttgctgaaa ggaggaacta tatceggat 5174 



<210> 2 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human 
full-length TyrRS in pET20B 



<400> 2 

Met Gly Asp Ala 
1 

Asn Leu Gin Glu 

20 

Glu Arg Glu Leu 
35 

His Val Ala Tyr 
50 



Pro Ser Pro Glu 
5 

Val Leu Gly Glu 

Lys He Tyr Trp 

40 

Phe Val Pro Met 
55 



Glu Lys Leu His 
10 

Glu Lys Leu Lys 
25 

Gly Thr Ala Thr 

Ser Lys He Ala 

60 



Leu He Thr Arg 

15 

Glu He Leu Lys 
30 

Thr Gly Lys Pro 
45 

Asp Phe Leu Lys 
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Ala Gly Cys Glu Val Thr He Leu Phe Ala Asp Leu His Ala Tyr Leu 
65 70 75 80 

Asp Asn Met Lys Ala Pro Trp Glu Leu Leu Glu Leu Arg Val Ser Tyr 

85 90 95 

Tyr Glu Asn Val He Lys Ala Met Leu Glu Ser He Gly Val Pro Leu 

100 105 HO 

Glu Lys Leu Lys Phe He Lys Gly Thr Asp Tyr Gin Leu Ser Lys Glu 
115 120 125 

Tyr Thr Leu Asp Val Tyr Arg Leu Ser Ser Val Val Thr Gin His Asp 
130 135 140 

Ser Lys Lys Ala Gly Ala Glu Val Val Lys Gin Val Glu His Pro Leu 
145 150 155 160 

Leu Ser Gly Leu Leu Tyr Pro Gly Leu Gin Ala Leu Asp Glu Glu Tyr 

165 170 175 

Leu Lys Val Asp Ala Gin Phe Gly Gly He Asp Gin Arg Lys He Phe 

180 185 190 

Thr Phe Ala Glu Lys Tyr Leu Pro Ala Leu Gly Tyr Ser Lys Arg Val 
195 200 205 

His Leu Met Asn Pro Met Val Pro Gly Leu Thr Gly Ser Lys Met Ser 
210 215 220 

Ser Ser Glu Glu Glu Ser Lys He Asp Leu Leu Asp Arg Lys Glu Asp 
225 230 235 240 

Val Lys Lys Lys Leu Lys Lys Ala Phe Cys Glu Pro Gly Asn Val Glu 

245 250 255 

Asn Asn Gly Val Leu Ser Phe He Lys His Val Leu Phe Pro Leu Lys 

260 265 270 

Ser Glu Phe Val He Leu Arg Asp Glu Lys Trp Gly Gly Asn Lys Thr 
275 280 285 

Tyr Thr Ala Tyr Val Asp Leu Glu Lys Asp Phe Ala Ala Glu Val Val 
290 295 300 

His Pro Gly Asp Leu Lys Asn Ser Val Glu Val Ala Leu Asn Lys Leu 
305 310 315 320 

Leu Asp Pro He Arg Glu Lys Phe Asn Thr Pro Ala Leu Lys Lys Leu 

325 330 335 

Ala Ser Ala Ala Tyr Pro Asp Pro Ser Lys Gin Lys Pro Met Ala Lys 

340 345 350 

Gly Pro Ala Lys Asn Ser Glu Pro Glu Glu Val He Pro Ser Arg Leu 
355 360 365 

Asp He Arg Val Gly Lys He He Thr Val Glu Lys His Pro Asp Ala 
370 375 380 

Asp Ser Leu Tyr Val Glu Lys He Asp Val Gly Glu Ala Glu Pro Arg 
385 390 395 400 
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Thr Val Val Ser Gly Leu Val Gin Phe Val Pro Lys Glu Glu Leu Gin 

405 410 415 

Asp Arg Leu Val Val Val Leu Cys Asn Leu Lys Pro Gin Lys Met Arg 

420 425 430 

Gly Val Glu Ser Gin Gly Met Leu Leu Cys Ala Ser lie Glu Gly lie 
435 440 445 

Asn Arg Gin Val Glu Pro Leu Asp Pro Pro Ala Gly Ser Ala Pro Gly 
450 455 460 

Glu His Val Phe Val Lys Gly Tyr Glu Lys Gly Gin Pro Asp Glu Glu 
465 470 475 480 

Leu Lys Pro Lys Lys Lys Val Phe Glu Lys Leu Gin Ala Asp Phe Lys 

485 490 495 

He Ser Glu Glu Cys He Ala Gin Trp Lys Gin Thr Asn Phe Met Thr 

500 505 510 

Lys Leu Gly Ser He Ser Cys Lys Ser Leu Lys Gly Gly Asn He Ser 
515 520 525 

Leu Glu His His His His His His 
530 535 



<210> 3 
<211> 4682 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (3428) . . (4543) 
<220> 

<223> Description of Artificial Sequence: human mini 
TyrRS in pET20B 

<400> 3 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 54 0 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 600 
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gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 660 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 720 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 780 
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 840 
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 900 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 960 
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 1020 
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 1080 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 1140 
tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 1200 
ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 1260 
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 1320 
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 1380 
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 1440 
actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 1500 
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 1560 
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 1620 
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 1740 
aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 1800 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 1860 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 1920 
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 1980 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 204 0 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 2100 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 2160 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 2220 
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 2280 
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 2340 
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 2400 
gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg 24 60 
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tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat 2520 

cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 

gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 

gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct 2700 

catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt 27 60 

tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg 2820 

ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa 2880 

tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc 2940 

ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa 3000 

aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta 3060 

gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg 3120 

tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag 3180 

acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac 3240 

cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca 3300 

cccgtggcca ggacccaacg ctgcccgaga tctcgatccc gcgaaattaa tacgactcac 3360 

tatagggaga ccacaacggt ttccctctag aaataatttt gtttaacttt aagaaggaga 3420 

tatacat atg ggg gac get ccc age cct gaa gag aaa ctg cac ctt ate 3469 
Met Gly Asp Ala Pro Ser Pro Glu Glu Lys Leu His Leu He 
15 10 

acc egg aac ctg cag gag gtt ctg ggg gaa gag aag ctg aag gag ata 3517 
Thr Arg Asn Leu Gin Glu Val Leu Gly Glu Glu Lys Leu Lys Glu He 
15 20 25 30 

ctg aag gag egg gaa ctt aaa att tac tgg gga acg gca acc acg ggc 3565 
Leu Lys Glu Arg Glu Leu Lys He Tyr Trp Gly Thr Ala Thr Thr Gly 

35 40 45 

aaa cca cat gtg get tac ttt gtg ccc atg tea aag att gca gac ttc 3613 
Lys Pro His Val Ala Tyr Phe Val Pro Met Ser Lys He Ala Asp Phe 

50 55 60 

tta aag gca ggg tgt gag gta aca att ctg ttt gcg gac etc cac gca 3661 
Leu Lys Ala Gly Cys Glu Val Thr He Leu Phe Ala Asp Leu His Ala 

65 70 75 

tac ctg gat aac atg aaa gec cca tgg gaa ctt eta gaa etc cga gtc 3709 
Tyr Leu Asp Asn Met Lys Ala Pro Trp Glu Leu Leu Glu Leu Arg Val 
80 85 90 

agt tac tat gag aat gtg ate aaa gca atg ctg gag age att ggt gtg 3757 
Ser Tyr Tyr Glu Asn Val He Lys Ala Met Leu Glu Ser He Gly Val 
95 100 105 110 
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ccc ttg gag aag etc aag ttc ate aaa ggc act gat tac cag etc age 
Pro Leu Glu Lys Leu Lys Phe He Lys Gly Thr Asp Tyr Gin Leu Ser 

115 120 125 

aaa gag tac aca eta gat gtg tac aga etc tec tec gtg gtc aca cag 
Lys Glu Tyr Thr Leu Asp Val Tyr Arg Leu Ser Ser Val Val Thr Gin 

130 135 140 

cac gat tec aag aag get gga get gag gtg gta aag cag gtg gag cac 
His Asp Ser Lys Lys Ala Gly Ala Glu Val Val Lys Gin Val Glu His 
145 150 155 

cct ttg ctg agt ggc etc tta tac ccc gga ctg cag get ttg gat gaa 
Pro Leu Leu Ser Gly Leu Leu Tyr Pro Gly Leu Gin Ala Leu Asp Glu 
160 165 170 

gag tat tta aaa gta gat gee caa ttt gga ggc att gat cag aga aag 
Glu Tyr Leu Lys Val Asp Ala Gin Phe Gly Gly He Asp Gin Arg Lys 
175 180 185 190 

att ttc ace ttt gca gag aag tac etc cct gca ctt ggc tat tea aaa 
He Phe Thr Phe Ala Glu Lys Tyr Leu Pro Ala Leu Gly Tyr Ser Lys 

195 200 205 

egg gtc cat ctg atg aat cct atg gtt cca gga tta aca ggc age aaa 
Arg Val His Leu Met Asn Pro Met Val Pro Gly Leu Thr Gly Ser Lys 

210 215 220 



3805 



gag gat gtg aag aaa aaa ctg aag aag gec ttc tgt gag cca gga aat 
Glu Asp Val Lys Lys Lys Leu Lys Lys Ala Phe Cys Glu Pro Gly Asn 
240 245 250 



ctt aag tec gag ttt gtg ate eta cga gat gag aaa tgg ggt gga aac 
Leu Lys Ser Glu Phe Val He Leu Arg Asp Glu Lys Trp Gly Gly Asn 

275 280 285 



gtt gta cat cct gga gac ctg aag aat tct gtt gaa gtc gca ctg aac 
Val Val His Pro Gly Asp Leu Lys Asn Ser Val Glu Val Ala Leu Asn 
305 310 315 

aag ttg ctg gat cca ate egg gaa aag ttt aat ace cct gee ctg aaa 
Lys Leu Leu Asp Pro He Arg Glu Lys Phe Asn Thr Pro Ala Leu Lys 
320 325 330 



3853 



3901 



3949 



3997 



4045 



4093 



atg age tct tea gaa gag gag tec aag att gat etc ctt gat egg aag 4141 
Met Ser Ser Ser Glu Glu Glu Ser Lys He Asp Leu Leu Asp Arg Lys 
225 230 235 



4189 



gtg gag aac aat ggg gtt ctg tec ttc ate aag cat gtc ctt ttt ccc 4237 
Val Glu Asn Asn Gly Val Leu Ser Phe He Lys His Val Leu Phe Pro 
255 260 265 270 



4285 



aaa ace tac aca get tac gtg gac ctg gaa aag gac ttt get get gag 4333 
Lys Thr Tyr Thr Ala Tyr Val Asp Leu Glu Lys Asp Phe Ala Ala Glu 

290 295 300 



4381 



4429 



aaa ctg gee age get gee tac cca gat ccc tea aag cag aag cca atg 4 47 7 
Lys Leu Ala Ser Ala Ala Tyr Pro Asp Pro Ser Lys Gin Lys Pro Met 
335 340 345 350 
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gcc aaa ggc cct gcc aag aat tea gaa cca gag gag gtc ate etc gag 4525 
Ala Lys Gly Pro Ala Lys Asn Ser Glu Pro Glu Glu Val lie Leu Glu 

355 360 365 

cac cac cac cac cac cac tgagatcegg ctgctaacaa ageccgaaag 4573 
His His His His His His 

370 

gaagctgagt tggctgctgc caccgctgag caataactag cataacccct tggggectet 4 633 
aaaegggtet tgaggggttt tttgctgaaa ggaggaacta tatceggat 4 682 



<210> 4 
<211> 372 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human mini 
TyrRS in pET20B 

<400> 4 

Met Gly Asp Ala Pro Ser Pro Glu Glu Lys Leu His Leu lie Thr Arg 
15 10 15 

Asn Leu Gin Glu Val Leu Gly Glu Glu Lys Leu Lys Glu lie Leu Lys 

20 25 30 

Glu Arg Glu Leu Lys lie Tyr Trp Gly Thr Ala Thr Thr Gly Lys Pro 
35 40 45 

His Val Ala Tyr Phe Val Pro Met Ser Lys lie Ala Asp Phe Leu Lys 
50 55 60 

Ala Gly Cys Glu Val Thr lie Leu Phe Ala Asp Leu His Ala Tyr Leu 
65 70 75 80 

Asp Asn Met Lys Ala Pro Trp Glu Leu Leu Glu Leu Arg Val Ser Tyr 

85 90 95 

Tyr Glu Asn Val He Lys Ala Met Leu Glu Ser- He Gly Val Pro Leu 

100 105 110 

Glu Lys Leu Lys Phe He Lys Gly Thr Asp Tyr Gin Leu Ser Lys Glu 
115 120 125 

Tyr Thr Leu Asp Val Tyr Arg Leu Ser Ser Val Val Thr Gin His Asp 
130 135 140 

Ser Lys Lys Ala Gly Ala Glu Val Val Lys Gin Val Glu His Pro Leu 
145 150 155 160 

Leu Ser Gly Leu Leu Tyr Pro Gly Leu Gin Ala Leu Asp Glu Glu Tyr 

165 170 175 

Leu Lys Val Asp Ala Gin Phe Gly Gly He Asp Gin Arg Lys He Phe 

180 185 190 



Thr Phe Ala Glu Lys Tyr Leu Pro Ala Leu Gly Tyr Ser Lys Arg Val 
195 200 205 
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His Leu Met Asn Pro Met Val Pro Gly Leu Thr Gly Ser Lys Met Ser 
210 215 220 

Ser Ser Glu Glu Glu Ser Lys lie Asp Leu Leu Asp Arg Lys Glu Asp 
225 230 235 240 

Val Lys Lys Lys Leu Lys Lys Ala Phe Cys Glu Pro Gly Asn Val Glu 

245 250 255 

Asn Asn Gly Val Leu Ser Phe lie Lys His Val Leu Phe Pro Leu Lys 

260 265 270 

Ser Glu Phe Val He Leu Arg Asp Glu Lys Trp Gly Gly Asn Lys Thr 
275 280 285 

Tyr Thr Ala Tyr Val Asp Leu Glu Lys Asp Phe Ala Ala Glu Val Val 
290 295 300 

His Pro Gly Asp Leu Lys Asn Ser Val Glu Val Ala Leu Asn Lys Leu 
305 310 315 320 

Leu Asp Pro He Arg Glu Lys Phe Asn Thr Pro Ala Leu Lys Lys Leu 

325 330 335 

Ala Ser Ala Ala Tyr Pro Asp Pro Ser Lys Gin Lys Pro Met Ala Lys 

340 345 350 

Gly Pro Ala Lys Asn Ser Glu Pro Glu Glu Val He Leu Glu His His 
355 360 365 

His His His His 
370 



<210> 5 
<211> 4100 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (3428) . . (3961) 



<220> 

<223> Description of Artificial Sequence: human TyrRS 
carboxyl-terminal domain in pET20B 



<400> 5 
tggcgaatgg 


gacgcgccct 


gtagcggcgc 


attaagcgcg 


gcgggtgtgg 


tggttacgcg 


60 


cagcgtgacc 


gctacacttg 


ccagcgccct 


agcgcccgct 


cctttcgctt 


tcttcccttc 


120 


ctttctcgcc 


acgttcgccg 


gctttccccg 


tcaagctcta 


aatcgggggc 


tccctttagg 


180 


gttccgattt 


agtgctttac 


ggcacctcga 


ccccaaaaaa 


cttgattagg 


gtgatggttc 


240 


acgtagtggg 


ccatcgccct 


gatagacggt 


ttttcgccct 


ttgacgttgg 


agtccacgtt 


300 


ctttaatagt 


ggactcttgt 


tccaaactgg 


aacaacactc 


aaccctatct 


cggtctattc 


360 


ttttgattta 


taagggattt 


tgccgatttc 


ggcctattgg 


ttaaaaaatg 


agctgattta 


420 
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acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 600 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 660 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 720 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 780 
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 840 
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 900 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 960 
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 1020 
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 1080 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 1140 
tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 1200 
ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 1260 
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 1320 
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 1380 
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 1440 
actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 1500 
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 1560 
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 1620 
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 1740 
aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 1800 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 18 60 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 1920 
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 198 0 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 2040 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 2100 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 2160 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 2220 
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 2280 
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ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 2340 

taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 2400 

gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg 24 60 

tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat 2520 

cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 

gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 

gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct 2700 

catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt 27 60 

tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg 2820 

ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa 2880 

tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc 2940 

ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa 3000 

aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta 3060 

gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg 3120 

tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag 3180 

acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac 3240 

cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca 3300 

cccgtggcca ggacccaacg ctgcccgaga tctcgatccc gcgaaattaa tacgactcac 3360 

tatagggaga ccacaacggt ttccctctag aaataatttt gtttaacttt aagaaggaga 3420 

tatacat atg cca gag gag gtc ate cca tec egg ctg gat ate cgt gtg 34 69 
Met Pro Glu Glu Val lie Pro Ser Arg Leu Asp lie Arg Val 
15 10 

ggg aaa ate ate act gtg gag aag cac cca gat gca gac age ctg tat 3517 
Gly Lys lie lie Thr Val Glu Lys His Pro Asp Ala Asp Ser Leu Tyr 
15 20 25 30 

gta gag aag att gac gtg ggg gaa get gaa cca egg act gtg gtg age 3565 
Val Glu Lys He Asp Val Gly Glu Ala Glu Pro Arg Thr Val Val Ser 

35 40 45 

ggc ctg gta cag ttc gtg ccc aag gag gaa ctg cag gac agg ctg gta 3613 
Gly Leu Val Gin Phe Val Pro Lys Glu Glu Leu Gin Asp Arg Leu Val 

50 55 60 

gtg gtg ctg tgc aac ctg aaa ccc cag aag atg aga gga gtc gag tec 3661 
Val Val Leu Cys Asn Leu Lys Pro Gin Lys Met Arg Gly Val Glu Ser 

65 70 75 

caa ggc atg ctt ctg tgt get tct ata gaa ggg ata aac cgc cag gtt 3709 
Gin Gly Met Leu Leu Cys Ala Ser He Glu Gly He Asn Arg Gin Val 
80 85 90 
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gaa cct ctg gac cct ccg gca ggc tct get cct ggt gag cac gtg ttt 3757 
Glu Pro Leu Asp Pro Pro Ala Gly Ser Ala Pro Gly Glu His Val Phe 
95 100 105 110 

gtg aag ggc tat gaa aag ggc caa cca gat gag gag etc aag ccc aag 3805 
Val Lys Gly Tyr Glu Lys Gly Gin Pro Asp Glu Glu Leu Lys Pro Lys 

115 120 125 

aag aaa gtc ttc gag aag ttg cag get gac ttc aaa att tct gag gag 3853 
Lys Lys Val Phe Glu Lys Leu Gin Ala Asp Phe Lys lie Ser Glu Glu 

130 135 140 

tgc ate gca cag tgg aag caa acc aac ttc atg acc aag ctg ggc tec 3901 
Cys lie Ala Gin Trp Lys Gin Thr Asn Phe Met Thr Lys Leu Gly Ser 
145 150 155 

att tec tgt aaa teg ctg aaa ggg ggg aac att age etc gag cac cac 394 9 
lie Ser Cys Lys Ser Leu Lys Gly Gly Asn lie Ser Leu Glu His His 
160 165 170 

cac cac cac cac tgagatcegg ctgctaacaa ageccgaaag gaagctgagt 4001 

His His His His 

175 

tggctgctgc caccgctgag caataactag cataacccct tggggectet aaaegggtet 4061 
tgaggggttt tttgctgaaa ggaggaacta tatceggat 4100 



<210> 6 
<211> 178 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human TyrRS 
carboxyl- terminal domain in pET20B 

<400> 6 

Met Pro Glu Glu Val lie Pro Ser Arg Leu Asp lie Arg Val Gly Lys 
15 10 15 

He He Thr Val Glu Lys His Pro Asp Ala Asp Ser Leu Tyr Val Glu 

20 25 30 

Lys He Asp Val Gly Glu Ala Glu Pro Arg Thr Val Val Ser Gly Leu 
35 40 45 

Val Gin Phe Val Pro Lys Glu Glu Leu Gin Asp Arg Leu Val Val Val 
50 55 60 

Leu Cys Asn Leu Lys Pro Gin Lys Met Arg Gly Val Glu Ser Gin Gly 
65 70 75 80 

Met Leu Leu Cys Ala Ser He Glu Gly He Asn Arg Gin Val Glu Pro 

85 90 95 

Leu Asp Pro Pro Ala Gly Ser Ala Pro Gly Glu His Val Phe Val Lys 

100 105 110 



Gly Tyr Glu Lys Gly Gin Pro Asp Glu Glu Leu Lys Pro Lys Lys Lys 
115 120 125 
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Val Phe Glu Lys 
130 

Ala Gin Trp Lys 
145 

Cys Lys Ser Leu 



His His 



Leu Gin Ala Asp 
135 

Gin Thr Asn Phe 
150 

Lys Gly Gly Asn 
165 



Phe Lys lie Ser 

140 

Met Thr Lys Leu 
155 

lie Ser Leu Glu 
170 



Glu Glu Cys lie 



Gly Ser lie Ser 

160 

His His His His 
175 



<210> 7 
<211> 4682 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (3428) . . (4543) 
<220> 

<223> Description of Artificial Sequence: human mini 
TyrRS mutant in pET20B 

<400> 7 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 600 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 660 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 720 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 780 
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 840 
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 900 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 960 
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 1020 
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aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 1080 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 1140 
tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 1200 
ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 1260 
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 1320 
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 1380 
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 14 40 
actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 1500 
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 1560 
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 1620 
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 1740 
aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 1800 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 1860 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 1920 
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 1980 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 2040 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 2100 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 2160 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 2220 
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 2280 
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 2340 
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 2400 
gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg 24 60 
tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat 2520 
cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 
gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 
gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct 2700 
catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt 2760 
tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg 2820 
ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa 2880 
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tgataccgat 


gaaacgagag 


aggatgctca 


cgatacgggt 


tactgatgat 


gaacatgccc 


2940 


ggttactgga 


acgttgtgag 


ggtaaacaac 


tggcggtatg 


gatgcggcgg 


gaccagagaa 


3000 


aaatcactca 


gggtcaatgc 


cagcgcttcg 


ttaatacaga 


tgtaggtgtt 


ccacagggta 


3060 


gccagcagca 


tcctgcgatg 


cagatccgga 


acataatggt 


gcagggcgct 


gacttccgcg 


3120 


tttccagact 


ttacgaaaca 


cggaaaccga 


agaccattca 


tgttgttgct 


caggtcgcag 


3180 


acgttttgca 


gcagcagtcg 


cttcacgttc 


gctcgcgtat 


cggtgattca 


ttctgctaac 


3240 


cagtaaggca 


accccgccag 


cctagccggg 


tcctcaacga 


caggagcacg 


atcatgcgca 


3300 


cccgtggcca 


ggacccaacg 


ctgcccgaga 


tctcgatccc 


gcgaaattaa 


tacgactcac 


3360 


tatagggaga 


ccacaacggt 


ttccctctag 


aaataatttt 


gtttaacttt 


aagaaggaga 


3420 



tatacat atg ggg gac get ccc age cct gaa gag aaa ctg cac ctt ate 3469 
Met Gly Asp Ala Pro Ser Pro Glu Glu Lys Leu His Leu lie 
15 10 

acc egg aac ctg cag gag gtt ctg ggg gaa gag aag ctg aag gag ata 3517 
Thr Arg Asn Leu Gin Glu Val Leu Gly Glu Glu Lys Leu Lys Glu He 
15 20 25 30 

ctg aag gag egg gaa ctt aaa att tac tgg gga acg gca acc acg ggc 3565 
Leu Lys Glu Arg Glu Leu Lys He Tyr Trp Gly Thr Ala Thr Thr Gly 

35 40 45 

aaa cca cat gtg get tac ttt gtg ccc atg tea aag att gca gac ttc 3613 
Lys Pro His Val Ala Tyr Phe Val Pro Met Ser Lys He Ala Asp Phe 

50 55 60 

tta aag gca ggg tgt gag gta aca att ctg ttt gcg gac etc cac gca 3661 
Leu Lys Ala Gly Cys Glu Val Thr He Leu Phe Ala Asp Leu His Ala 
65 70 75 

tac ctg gat aac atg aaa gec cca tgg gaa ctt eta gaa ctg cag gtc 3709 
Tyr Leu Asp Asn Met Lys Ala Pro Trp Glu Leu Leu Glu Leu Gin Val 
80 85 90 

agt tac tat gag aat gtg ate aaa gca atg ctg gag age att ggt gtg 3757 
Ser Tyr Tyr Glu Asn Val He Lys Ala Met Leu Glu Ser He Gly Val 
95 100 105 110 

ccc ttg gag aag etc aag ttc ate aaa ggc act gat tac cag etc age 3805 
Pro Leu Glu Lys Leu Lys Phe He Lys Gly Thr Asp Tyr Gin Leu Ser 

115 120 125 



aaa gag tac aca eta gat gtg tac aga etc tec tec gtg gtc aca cag 
Lys Glu Tyr Thr Leu Asp Val Tyr Arg Leu Ser Ser Val Val Thr Gin 

130 135 140 



3853 



cac gat tec aag aag get gga get gag gtg gta aag cag gtg gag cac 3901 

His Asp Ser Lys Lys Ala Gly Ala Glu Val Val Lys Gin Val Glu His 
145 150 155 

cct ttg ctg agt ggc etc tta tac ccc gga ctg cag get ttg gat gaa 394 9 

Pro Leu Leu Ser Gly Leu Leu Tyr Pro Gly Leu Gin Ala Leu Asp Glu 
160 165 170 
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gag tat tta aaa gta gat gcc caa ttt gga ggc att gat cag aga aag 3997 
Glu Tyr Leu Lys Val Asp Ala Gin Phe Gly Gly He Asp Gin Arg Lys 
175 180 185 190 



att ttc acc ttt gca gag aag tac etc cct gca ctt ggc tat tea aaa 
He Phe Thr Phe Ala Glu Lys Tyr Leu Pro Ala Leu Gly Tyr Ser Lys 

195 200 205 



4045 



egg gtc cat ctg atg aat cct atg gtt cca gga tta aca ggc age aaa 4093 
Arg Val His Leu Met Asn Pro Met Val Pro Gly Leu Thr Gly Ser Lys 

210 215 220 

atg age tct tea gaa gag gag tec aag att gat etc ctt gat egg aag 4141 
Met Ser Ser Ser Glu Glu Glu Ser Lys He Asp Leu Leu Asp Arg Lys 
225 230 235 

gag gat gtg aag aaa aaa ctg aag aag gcc ttc tgt gag cca gga aat 418 9 
Glu Asp Val Lys Lys Lys Leu Lys Lys Ala Phe Cys Glu Pro Gly Asn 
240 245 250 

gtg gag aac aat ggg gtt ctg tec ttc ate aag cat gtc ctt ttt ccc 4237 
Val Glu Asn Asn Gly Val Leu Ser Phe He Lys His Val Leu Phe Pro 
255 260 265 270 

ctt aag tec gag ttt gtg ate eta cga gat gag aaa tgg ggt gga aac 4285 
Leu Lys Ser Glu Phe Val He Leu Arg Asp Glu Lys Trp Gly Gly Asn 

275 280 285 

aaa acc tac aca get tac gtg gac ctg gaa aag gac ttt get get gag 4333 
Lys Thr Tyr Thr Ala Tyr Val Asp Leu Glu Lys Asp Phe Ala Ala Glu 

290 295 300 

gtt gta cat cct gga gac ctg aag aat tct gtt gaa gtc gca ctg aac 4381 

Val Val His Pro Gly Asp Leu Lys Asn Ser Val Glu Val Ala Leu Asn 
305 310 315 

aag ttg ctg gat cca ate egg gaa aag ttt aat acc cct gcc ctg aaa 4429 
Lys Leu Leu Asp Pro He Arg Glu Lys Phe Asn Thr Pro Ala Leu Lys 
320 325 330 

aaa ctg gcc age get gcc tac cca gat ccc tea aag cag aag cca atg 4 477 
Lys Leu Ala Ser Ala Ala Tyr Pro Asp Pro Ser Lys Gin Lys Pro Met 
335 340 345 350 

gcc aaa ggc cct gcc aag aat tea gaa cca gag gag gtc ate etc gag 4525 
Ala Lys Gly Pro Ala Lys Asn Ser Glu Pro Glu Glu Val He Leu Glu 

355 360 365 

cac cac cac cac cac cac tgagatcegg ctgctaacaa ageccgaaag 4 573 

His His His His His His 

370 

gaagctgagt tggctgctgc caccgctgag caataactag cataacccct tggggectet 4 633 
aaaegggtet tgaggggttt tttgctgaaa ggaggaacta tatceggat 4 682 
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<210> 8 
<211> 372 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human mini 
TyrRS mutant in pET20B 

<400> 8 

Met Gly Asp Ala Pro Ser Pro Glu Glu Lys Leu His Leu He Thr Arg 
15 10 15 

Asn Leu Gin Glu Val Leu Gly Glu Glu Lys Leu Lys Glu He Leu Lys 

20 25 30 

Glu Arg Glu Leu Lys He Tyr Trp Gly Thr Ala Thr Thr Gly Lys Pro 
35 40 45 

His Val Ala Tyr Phe Val Pro Met Ser Lys He Ala Asp Phe Leu Lys 
50 55 60 

Ala Gly Cys Glu Val Thr He Leu Phe Ala Asp Leu His Ala Tyr Leu 
65 70 75 80 

Asp Asn Met Lys Ala Pro Trp Glu Leu Leu Glu Leu Gin Val Ser Tyr 

85 90 95 

Tyr Glu Asn Val He Lys Ala Met Leu Glu Ser He Gly Val Pro Leu 

100 105 110 

Glu Lys Leu Lys Phe He Lys Gly Thr Asp Tyr Gin Leu Ser Lys Glu 
115 120 125 

Tyr Thr Leu Asp Val Tyr Arg Leu Ser Ser Val Val Thr Gin His Asp 
130 135 140 

Ser Lys Lys Ala Gly Ala Glu Val Val Lys Gin Val Glu His Pro Leu 
145 150 155 160 

Leu Ser Gly Leu Leu Tyr Pro Gly Leu Gin Ala Leu Asp Glu Glu Tyr 

165 170 175 

Leu Lys Val Asp Ala Gin Phe Gly Gly He Asp Gin Arg Lys He Phe 

180 185 190 

Thr Phe Ala Glu Lys Tyr Leu Pro Ala Leu Gly Tyr Ser Lys Arg Val 
195 200 205 

His Leu Met Asn Pro Met Val Pro Gly Leu Thr Gly Ser Lys Met Ser 
210 215 220 

Ser Ser Glu Glu Glu Ser Lys He Asp Leu Leu Asp Arg Lys Glu Asp 
225 230 235 240 

Val Lys Lys Lys Leu Lys Lys Ala Phe Cys Glu Pro Gly Asn Val Glu 

245 250 255 

Asn Asn Gly Val Leu Ser Phe He Lys His Val Leu Phe Pro Leu Lys 

260 265 270 
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Ser Glu Phe Val He Leu Arg Asp Glu Lys Trp Gly Gly Asn Lys Thr 
275 280 285 

Tyr Thr Ala Tyr Val Asp Leu Glu Lys Asp Phe Ala Ala Glu Val Val 
290 295 300 

His Pro Gly Asp Leu Lys Asn Ser Val Glu Val Ala Leu Asn Lys Leu 
305 310 315 320 

Leu Asp Pro He Arg Glu Lys Phe Asn Thr Pro Ala Leu Lys Lys Leu 

325 330 335 

Ala Ser Ala Ala Tyr Pro Asp Pro Ser Lys Gin Lys Pro Met Ala Lys 

340 345 350 

Gly Pro Ala Lys Asn Ser Glu Pro Glu Glu Val He Leu Glu His His 
355 360 365 

His His His His 
370 



<210> 9 
<211> 5018 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (3428) . . (4879) 
<220> 

<223> Description of Artificial Sequence: human 
full-length TrpRS in pET20B 

<400> 9 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 600 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 660 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 720 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 780 
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agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 840 
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 900 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 960 
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 1020 
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 1080 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 1140 
tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 1200 
ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 1260 
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 1320 
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 1380 
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 1440 
actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 1500 
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 1560 
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 1620 
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 17 40 
aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 1800 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 1860 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 1920 
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 1980 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 2040 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 2100 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 2160 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 2220 
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 2280 
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 2340 
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 2400 
gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg 2460 
tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat 2520 
cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 
gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 
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gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct 2700 

catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt 27 60 

tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg 2820 

ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa 2880 

tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc 2940 

ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa 3000 

aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta 3060 

gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg 3120 

tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag 3180 

acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac 3240 

cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca 3300 

cccgtggcca ggacccaacg ctgcccgaga tctcgatccc gcgaaattaa tacgactcac 3360 

tatagggaga ccacaacggt ttccctctag aaataatttt gtttaacttt aagaaggaga 3420 

tatacat atg ccc aac agt gag ccc gca tct ctg ctg gag ctg ttc aac 3469 
Met Pro Asn Ser Glu Pro Ala Ser Leu Leu Glu Leu Phe Asn 
1 5 10 

age ate gec aca caa ggg gag etc gta agg tec etc aaa gcg gga aat 3517 
Ser He Ala Thr Gin Gly Glu Leu Val Arg Ser Leu Lys Ala Gly Asn 
15 20 25 30 



gcg tea aag gat gaa att gat tct gca gta aag atg ttg gtg tea tta 
Ala Ser Lys Asp Glu He Asp Ser Ala Val Lys Met Leu Val Ser Leu 

35 40 45 



3565 



aaa atg age tac aaa get gee gcg ggg gag gat tac aag get gac tgt 3613 
Lys Met Ser Tyr Lys Ala Ala Ala Gly Glu Asp Tyr Lys Ala Asp Cys 

50 55 60 

cct cca ggg aac cca gca cct ace agt aat cat ggc cca gat gee aca 3661 
Pro Pro Gly Asn Pro Ala Pro Thr Ser Asn His Gly Pro Asp Ala Thr 
65 70 75 

gaa get gaa gag gat ttt gtg gac cca tgg aca gta cag aca age agt 3709 
Glu Ala Glu Glu Asp Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser 
80 85 90 

gca aaa ggc ata gac tac gat aag etc att gtt egg ttt gga agt agt 3757 
Ala Lys Gly He Asp Tyr Asp Lys Leu He Val Arg Phe Gly Ser Ser 

^ r- irtr Tin 



95 100 105 110 

aaa att gac aaa gag eta ata aac cga ata gag aga gee ace ggc caa 
Lys He Asp Lys Glu Leu He Asn Arg He Glu Arg Ala Thr Gly Gin 

115 120 125 



3805 



aga cca cac cac ttc ctg cgc aga ggc ate ttc ttc tea cac aga gat 3853 
Arg Pro His His Phe Leu Arg Arg Gly He Phe Phe Ser His Arg Asp 

130 135 140 
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atg aat cag gtt ctt gat gcc tat gaa aat aag aag cca ttt tat ctg 
Met Asn Gin Val Leu Asp Ala Tyr Glu Asn Lys Lys Pro Phe Tyr Leu 
145 150 155 

tac acg ggc egg ggc ccc tct tct gaa gca atg cat gta ggt cac etc 
Tyr Thr Gly Arg Gly Pro Ser Ser Glu Ala Met His Val Gly His Leu 
160 165 170 

att cca ttt att ttc aca aag tgg etc cag gat gta ttt aac gtg ccc 
He Pro Phe He Phe Thr Lys Trp Leu Gin Asp Val Phe Asn Val Pro 
175 180 185 190 

ttg gtc ate cag atg acg gat gac gag aag tat ctg tgg aag gac ctg 
Leu Val He Gin Met Thr Asp Asp Glu Lys Tyr Leu Trp Lys Asp Leu 

195 200 205 

acc ctg gac cag gcc tat ggc gat get gtt gag aat gcc aag gac ate 
Thr Leu Asp Gin Ala Tyr Gly Asp Ala Val Glu Asn Ala Lys Asp He 

210 215 220 



3901 



gac tac atg ggg atg age tea ggt ttc tac aaa aat gtg gtg aag att 
Asp Tyr Met Gly Met Ser Ser Gly Phe Tyr Lys Asn Val Val Lys He 
240 245 250 



gac age gac tgc att ggg aag ate agt ttt cct gcc ate cag get get 
Asp Ser Asp Cys He Gly Lys He Ser Phe Pro Ala He Gin Ala Ala 

275 280 285 



ate cag tgc ctt ate cca tgt gcc att gac cag gat cct tac ttt aga 
He Gin Cys Leu He Pro Cys Ala He Asp Gin Asp Pro Tyr Phe Arg 
305 310 315 



ttg cac tec acc ttc ttc cca gcc ctg cag ggc gcc cag acc aaa atg 
Leu His Ser Thr Phe Phe Pro Ala Leu Gin Gly Ala Gin Thr Lys Met 
335 340 345 350 

agt gcc age gac cca aac tec tec ate ttc etc acc gac acg gcc aag 
Ser Ala Ser Asp Pro Asn Ser Ser He Phe Leu Thr Asp Thr Ala Lys 

355 360 365 



3949 



3997 



4045 



4093 



ate gcc tgt ggc ttt gac ate aac aag act ttc ata ttc tct gac ctg 4141 
He Ala Cys Gly Phe Asp He Asn Lys Thr Phe He Phe Ser Asp Leu 
225 230 235 



4189 



caa aag cat gtt acc ttc aac caa gtg aaa ggc att ttc ggc ttc act 4237 
Gin Lys His Val Thr Phe Asn Gin Val Lys Gly He Phe Gly Phe Thr 
255 260 265 270 



4285 



ccc tec ttc age aac tea ttc cca cag ate ttc cga gac agg acg gat 4333 
Pro Ser Phe Ser Asn Ser Phe Pro Gin He Phe Arg Asp Arg Thr Asp 

290 295 300 



4381 



atg aca agg gac gtc gcc ccc agg ate ggc tat cct aaa cca gcc ctg 4429 
Met Thr Arg Asp Val Ala Pro Arg He Gly Tyr Pro Lys Pro Ala Leu 
320 325 330 



4477 



4525 



cag ate aaa acc aag gtc aat aag cat gcg ttt tct gga ggg aga gac 4573 
Gin He Lys Thr Lys Val Asn Lys His Ala Phe Ser Gly Gly Arg Asp 

370 375 380 
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acc ate gag gag cac agg cag ttt ggg ggc aac tgt gat gtg gac gtg 
Thr He Glu Glu His Arg Gin Phe Gly Gly Asn Cys Asp Val Asp Val 
385 390 395 

tct ttc atg tac ctg acc ttc ttc etc gag gac gac gac aag etc gag 
Ser Phe Met Tyr Leu Thr Phe Phe Leu Glu Asp Asp Asp Lys Leu Glu 
400 405 410 



4621 



aag aag gca etc ata gag gtt ctg cag ccc ttg ate gca gag cac cag 
Lys Lys Ala Leu He Glu Val Leu Gin Pro Leu He Ala Glu His Gin 

435 440 445 



ccc egg aag ctg tec ttc gac ttt cag aag ctt gcg gee gca etc gag 
Pro Arg Lys Leu Ser Phe Asp Phe Gin Lys Leu Ala Ala Ala Leu Glu 
465 470 475 

cac cac cac cac cac cac tgagatcegg ctgctaacaa ageccgaaag 
His His His His His His 
480 



4669 



cag ate agg aag gat tac acc age gga gee atg etc acc ggt gag etc 4717 
Gin He Arg Lys Asp Tyr Thr Ser Gly Ala Met Leu Thr Gly Glu Leu 
415 420 425 430 



4765 



gee egg cgc aag gag gtc acg gat gag ata gtg aaa gag ttc atg act 4813 
Ala Arg Arg Lys Glu Val Thr Asp Glu He Val Lys Glu Phe Met Thr 

450 455 460 



4861 



4909 



gaagctgagt tggctgctgc caccgctgag caataactag cataacccct tggggectet 4 969 
aaaegggtet tgaggggttt tttgctgaaa ggaggaacta tatceggat 5018 



<210> 10 
<211> 484 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human 
full-length TrpRS in pET20B 

<400> 10 

Met Pro Asn Ser Glu Pro Ala Ser Leu Leu Glu Leu Phe Asn Ser He 
15 10 15 

Ala Thr Gin Gly Glu Leu Val Arg Ser Leu Lys Ala Gly Asn Ala Ser 

20 25 30 

Lys Asp Glu He Asp Ser Ala Val Lys Met Leu Val Ser Leu Lys Met 
35 40 45 

Ser Tyr Lys Ala Ala Ala Gly Glu Asp Tyr Lys Ala Asp Cys Pro Pro 
50 55 60 

Gly Asn Pro Ala Pro Thr Ser Asn His Gly Pro Asp Ala Thr Glu Ala 
65 70 75 80 

Glu Glu Asp Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser Ala Lys 

85 90 95 
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Gly He Asp Tyr Asp Lys Leu He Val Arg Phe Gly Ser Ser Lys He 

100 105 HO 

Asp Lys Glu Leu He Asn Arg He Glu Arg Ala Thr Gly Gin Arg Pro 
115 120 125 

His His Phe Leu Arg Arg Gly He Phe Phe Ser His Arg Asp Met Asn 
130 135 140 

Gin Val Leu Asp Ala Tyr Glu Asn Lys Lys Pro Phe Tyr Leu Tyr Thr 
145 150 155 160 

Gly Arg Gly Pro Ser Ser Glu Ala Met His Val Gly His Leu He Pro 

165 170 175 

Phe He Phe Thr Lys Trp Leu Gin Asp Val Phe Asn Val Pro Leu Val 

180 185 190 

He Gin Met Thr Asp Asp Glu Lys Tyr Leu Trp Lys Asp Leu Thr Leu 
195 200 205 

Asp Gin Ala Tyr Gly Asp Ala Val Glu Asn Ala Lys Asp He He Ala 
210 215 220 

Cys Gly Phe Asp He Asn Lys Thr Phe He Phe Ser Asp Leu Asp Tyr 
225 230 235 240 

Met Gly Met Ser Ser Gly Phe Tyr Lys Asn Val Val Lys He Gin Lys 

245 250 255 

His Val Thr Phe Asn Gin Val Lys Gly He Phe Gly Phe Thr Asp Ser 

260 265 270 

Asp Cys lie Gly Lys He Ser Phe Pro Ala He Gin Ala Ala Pro Ser 
275 280 285 

Phe Ser Asn Ser Phe Pro Gin He Phe Arg Asp Arg Thr Asp He Gin 
290 295 300 

Cys Leu He Pro Cys Ala He Asp Gin Asp Pro Tyr Phe Arg Met Thr 
305 310 315 320 

Arg Asp Val Ala Pro Arg He Gly Tyr Pro Lys Pro Ala Leu Leu His 

325 330 335 

Ser Thr Phe Phe Pro Ala Leu Gin Gly Ala Gin Thr Lys Met Ser Ala 

340 345 350 

Ser Asp Pro Asn Ser Ser He Phe Leu Thr Asp Thr Ala Lys Gin He 
355 360 365 

Lys Thr Lys Val Asn Lys His Ala Phe Ser Gly Gly Arg Asp Thr He 
370 375 380 

Glu Glu His Arg Gin Phe Gly Gly Asn Cys Asp Val Asp Val Ser Phe 
385 390 395 400 

Met Tyr Leu Thr Phe Phe Leu Glu Asp Asp Asp Lys Leu Glu Gin He 

405 410 415 

Arg Lys Asp Tyr Thr Ser Gly Ala Met Leu Thr Gly Glu Leu Lys Lys 

420 425 430 
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Ala Leu He Glu Val Leu Gin Pro Leu He Ala Glu His Gin Ala Arg 
435 440 445 

Arg Lys Glu Val Thr Asp Glu He Val Lys Glu Phe Met Thr Pro Arg 
450 455 460 

Lys Leu Ser Phe Asp Phe Gin Lys Leu Ala Ala Ala Leu Glu His His 
465 470 475 480 

His His His His 



<210> 11 
<211> 4877 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (3428) . . (4738) 
<220> 

<223> Description of Artificial Sequence: human mini 
TrpRS in pET20B 

<400> 11 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 600 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 660 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 720 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 780 
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 840 
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 900 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 960 
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 1020 
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aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 1080 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 1140 
tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 1200 
ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 1260 
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 1320 
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 1380 
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 1440 
actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 1500 
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 1560 
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 1620 
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 1740 
aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 1800 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 1860 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 1920 
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 1980 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 2040 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 2100 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 2160 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 2220 
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 2280 
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 2340 
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 2400 
gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg 24 60 
tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat 2520 
cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 
gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 
gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct 2700 
catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt 2760 
tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg 2820 
ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa 2880 
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tgataccgat 


gaaacgagag 


aggatgetea 


egataegggt 


tactgatgat 


gaacatgccc 


2940 


ggttactgga 


acgttgtgag 


ggtaaacaac 


tggcggtatg 


gatgeggegg 


gaccagagaa 


3000 


aaatcactca 


gggtcaatgc 


cagcgcttcg 


ttaatacaga 


tgtaggtgtt 


ccacagggta 


3060 


gccagcagca 


tectgegatg 


cagatcegga 


acataatggt 


geagggeget 


gacttccgcg 


3120 


tttccagact 


ttacgaaaca 


eggaaacega 


agaccattca 


tgttgttgct 


caggtegcag 


3180 


acgttttgca 


gcagcagtcg 


cttcacgttc 


getcgegtat 


eggtgattea 


ttctgetaac 


3240 


cagtaaggca 


accccgccag 


cctagccggg 


tcctcaacga 


caggagcacg 


ateatgegea 


3300 


cccgtggcca 


ggacccaacg 


ctgcccgaga 


tctcgatccc 


gcgaaattaa 


tacgactcac 


3360 


tatagggaga 


ccacaacggt 


ttccctctag 


aaataatttt 


gtttaacttt 


aagaaggaga 


3420 


tatacat atg age tac aaa get gec < 


gcg ggg gag 


gat tac aag get gac 


3469 



Met Ser Tyr Lys Ala Ala Ala Gly Glu Asp Tyr Lys Ala Asp 
15 10 

tgt cct cca ggg aac cca gca cct acc agt aat cat ggc cca gat gec 3517 
Cys Pro Pro Gly Asn Pro Ala Pro Thr Ser Asn His Gly Pro Asp Ala 
15 20 25 30 

aca gaa get gaa gag gat ttt gtg gac cca tgg aca gta cag aca age 3565 

Thr Glu Ala Glu Glu Asp Phe Val Asp Pro Trp Thr Val Gin Thr Ser 

35 40 45 

agt gca aaa ggc ata gac tac gat aag etc att gtt egg ttt gga agt 3613 
Ser Ala Lys Gly lie Asp Tyr Asp Lys Leu lie Val Arg Phe Gly Ser 

50 55 60 

agt aaa att gac aaa gag eta ata aac cga ata gag aga gee acc ggc 3661 
Ser Lys lie Asp Lys Glu Leu lie Asn Arg lie Glu Arg Ala Thr Gly 

65 70 75 

caa aga cca cac cac ttc ctg cgc aga ggc ate ttc ttc tea cac aga 3709 
Gin Arg Pro His His Phe Leu Arg Arg Gly He Phe Phe Ser His Arg 
80 85 90 

gat atg aat cag gtt ctt gat gec tat gaa aat aag aag cca ttt tat 3757 
Asp Met Asn Gin Val Leu Asp Ala Tyr Glu Asn Lys Lys Pro Phe Tyr 
95 100 105 110 

ctg tac acg ggc egg ggc ccc tct tct gaa gca atg cat gta ggt cac 3805 
Leu Tyr Thr Gly Arg Gly Pro Ser Ser Glu Ala Met His Val Gly His 

115 120 125 

etc att cca ttt att ttc aca aag tgg etc cag gat gta ttt aac gtg 3853 
Leu lie Pro Phe He Phe Thr Lys Trp Leu Gin Asp Val Phe Asn Val 

130 135 140 

ccc ttg gtc ate cag atg acg gat gac gag aag tat ctg tgg aag gac 3901 
Pro Leu Val He Gin Met Thr Asp Asp Glu Lys Tyr Leu Trp Lys Asp 
145 150 155 

ctg acc ctg gac cag gec tat ggc gat get gtt gag aat gec aag gac 394 9 
Leu Thr Leu Asp Gin Ala Tyr Gly Asp Ala Val Glu Asn Ala Lys Asp 
160 165 170 
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ate ate gec tgt ggc ttt gac ate aac aag act ttc ata ttc tct gac 3997 
He He Ala Cys Gly Phe Asp He Asn Lys Thr Phe He Phe Ser Asp 
175 180 185 190 



ctg gac tac atg ggg atg age tea ggt ttc tac aaa aat gtg gtg aag 
Leu Asp Tyr Met Gly Met Ser Ser Gly Phe Tyr Lys Asn Val Val Lys 

195 200 205 



4045 



att caa aag cat gtt acc ttc aac caa gtg aaa ggc att ttc ggc ttc 4093 
He Gin Lys His Val Thr Phe Asn Gin Val Lys Gly He Phe Gly Phe 

210 215 220 

act gac age gac tgc att ggg aag ate agt ttt cct gee ate cag get 4141 
Thr Asp Ser Asp Cys He Gly Lys He Ser Phe Pro Ala He Gin Ala 
225 230 235 

get ccc tec ttc age aac tea ttc cca cag ate ttc cga gac agg acg 4189 
Ala Pro Ser Phe Ser Asn Ser Phe Pro Gin He Phe Arg Asp Arg Thr 
240 245 250 

gat ate cag tgc ctt ate cca tgt gee att gac cag gat cct tac ttt 4237 
Asp He Gin Cys Leu He Pro Cys Ala He Asp Gin Asp Pro Tyr Phe 
255 260 265 270 

aga atg aca agg gac gtc gee ccc agg ate ggc tat cct aaa cca gec 4285 
Arg Met Thr Arg Asp Val Ala Pro Arg He Gly Tyr Pro Lys Pro Ala 

275 280 285 

ctg ttg cac tec acc ttc ttc cca gee ctg cag ggc gee cag acc aaa 4333 
Leu Leu His Ser Thr Phe Phe Pro Ala Leu Gin Gly Ala Gin Thr Lys 

290 295 300 

atg agt gee age gac cca aac tec tec ate ttc etc acc gac acg gee 4381 
Met Ser Ala Ser Asp Pro Asn Ser Ser He Phe Leu Thr Asp Thr Ala 
305 310 315 

aag cag ate aaa acc aag gtc aat aag cat gcg ttt tct gga ggg aga 4 429 
Lys Gin He Lys Thr Lys Val Asn Lys His Ala Phe Ser Gly Gly Arg 
320 325 330 

gac acc ate gag gag cac agg cag ttt ggg ggc aac tgt gat gtg gac 4477 
Asp Thr He Glu Glu His Arg Gin Phe Gly Gly Asn Cys Asp Val Asp 
335 340 345 350 

gtg tct ttc atg tac ctg acc ttc ttc etc gag gac gac gac aag etc 4 525 
Val Ser Phe Met Tyr Leu Thr Phe Phe Leu Glu Asp Asp Asp Lys Leu 

355 360 365 

gag cag ate agg aag gat tac acc age gga gec atg etc acc ggt gag 4573 
Glu Gin He Arg Lys Asp Tyr Thr Ser Gly Ala Met Leu Thr Gly Glu 

370 375 380 

etc aag aag gca etc ata gag gtt ctg cag ccc ttg ate gca gag cac 4 621 
Leu Lys Lys Ala Leu He Glu Val Leu Gin Pro Leu He Ala Glu His 
385 390 395 

cag gee egg cgc aag gag gtc acg gat gag ata gtg aaa gag ttc atg 4 669 
Gin Ala Arg Arg Lys Glu Val Thr Asp Glu He Val Lys Glu Phe Met 
400 405 410 
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act ccc egg aag ctg tec ttc gac ttt cag aag ctt gcg gec gca etc 4717 
Thr Pro Arg Lys Leu Ser Phe Asp Phe Gin Lys Leu Ala Ala Ala Leu 
415 420 425 430 

gag cac cac cac cac cac cac tgagatcegg ctgctaacaa ageccgaaag 47 68 

Glu His His His His His His 

435 

gaagctgagt tggctgctgc caccgctgag caataactag cataacccct tggggectet 4828 
aaaegggtet tgaggggttt tttgctgaaa ggaggaacta tatceggat 4877 



<210> 12 

<211> 437 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human mini 
TrpRS in pET20B 

<400> 12 

Met Ser Tyr Lys Ala Ala Ala Gly Glu Asp Tyr Lys Ala Asp Cys Pro 
15 10 15 

Pro Gly Asn Pro Ala Pro Thr Ser Asn His Gly Pro Asp Ala Thr Glu 

20 25 30 

Ala Glu Glu Asp Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser Ala 

35 40 45 

Lys Gly lie Asp Tyr Asp Lys Leu He Val Arg Phe Gly Ser Ser Lys 
50 55 60 

He Asp Lys Glu Leu He Asn Arg He Glu Arg Ala Thr Gly Gin Arg 
65 70 75 80 

Pro His His Phe Leu Arg Arg Gly He Phe Phe Ser His Arg Asp Met 

85 90 " 95 

Asn Gin Val Leu Asp Ala Tyr Glu Asn Lys Lys Pro Phe Tyr Leu Tyr 

100 105 110 

Thr Gly Arg Gly Pro Ser Ser Glu Ala Met His Val Gly His Leu He 
115 120 125 

Pro Phe He Phe Thr Lys Trp Leu Gin Asp Val Phe Asn Val Pro Leu 
130 135 140 

Val He Gin Met Thr Asp Asp Glu Lys Tyr Leu Trp Lys Asp Leu Thr 
145 150 155 160 

Leu Asp Gin Ala Tyr Gly Asp Ala Val Glu Asn Ala Lys Asp He He 

165 170 175 

Ala Cys Gly Phe Asp He Asn Lys Thr Phe He Phe Ser Asp Leu Asp 

180 185 190 

Tyr Met Gly Met Ser Ser Gly Phe Tyr Lys Asn Val Val Lys He Gin 
195 200 205 
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Lys His Val Thr Phe Asn Gin Val Lys Gly He Phe Gly Phe Thr Asp 
210 215 220 

Ser Asp Cys He Gly Lys He Ser Phe Pro Ala He Gin Ala Ala Pro 
225 230 235 240 

Ser Phe Ser Asn Ser Phe Pro Gin He Phe Arg Asp Arg Thr Asp He 

245 250 255 

Gin Cys Leu He Pro Cys Ala He Asp Gin Asp Pro Tyr Phe Arg Met 

260 265 270 

Thr Arg Asp Val Ala Pro Arg He Gly Tyr Pro Lys Pro Ala Leu Leu 
275 280 285 

His Ser Thr Phe Phe Pro Ala Leu Gin Gly Ala Gin Thr Lys Met Ser 
290 295 300 

Ala Ser Asp Pro Asn Ser Ser He Phe Leu Thr Asp Thr Ala Lys Gin 
305 310 315 320 

He Lys Thr Lys Val Asn Lys His Ala Phe Ser Gly Gly Arg Asp Thr 

325 330 335 

He Glu Glu His Arg Gin Phe Gly Gly Asn Cys Asp Val Asp Val Ser 

340 345 350 

Phe Met Tyr Leu Thr Phe Phe Leu Glu Asp Asp Asp Lys Leu Glu Gin 
355 360 365 

He Arg Lys Asp Tyr Thr Ser Gly Ala Met Leu Thr Gly Glu Leu Lys 
370 375 380 

Lys Ala Leu He Glu Val Leu Gin Pro Leu He Ala Glu His Gin Ala 
385 390 395 400 

Arg Arg Lys Glu Val Thr Asp Glu He Val Lys Glu Phe Met Thr Pro 

405 410 415 

Arg Lys Leu Ser Phe Asp Phe Gin Lys Leu Ala Ala Ala Leu Glu His 

420 425 430 



His His His His His 
435 



<210> 13 
<211> 4811 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (3428) . . (4672) 
<220> 

<223> Description of Artificial Sequence: human 
supermini TrpRS in pET20B 

<400> 13 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 



cagcgtgacc gctacacttg ccagcgccct 
ctttctcgcc acgttcgccg gctttccccg 
gttccgattt agtgctttac ggcacctcga 
acgtagtggg ccatcgccct gatagacggt 
ctttaatagt ggactcttgt tccaaactgg 
ttttgattta taagggattt tgccgatttc 
acaaaaattt aacgcgaatt ttaacaaaat 
tcggggaaat gtgcgcggaa cccctatttg 
tccgctcatg agacaataac cctgataaat 
gagtattcaa catttccgtg tcgcccttat 
ttttgctcac ccagaaacgc tggtgaaagt 
agtgggttac atcgaactgg atctcaacag 
agaacgtttt ccaatgatga gcacttttaa 
tattgacgcc gggcaagagc aactcggtcg 
tgagtactca ccagtcacag aaaagcatct 
cagtgctgcc ataaccatga gtgataacac 
aggaccgaag gagctaaccg cttttttgca 
tcgttgggaa ccggagctga atgaagccat 
tgcagcaatg gcaacaacgt tgcgcaaact 
ccggcaacaa ttaatagact ggatggaggc 
ggcccttccg gctggctggt ttattgctga 
cggtatcatt gcagcactgg ggccagatgg 
gacggggagt caggcaacta tggatgaacg 
actgattaag cattggtaac tgtcagacca 
aaaacttcat ttttaattta aaaggatcta 
caaaatccct taacgtgagt tttcgttcca 
aggatcttct tgagatcctt tttttctgcg 
accgctacca gcggtggttt gtttgccgga 
aactggcttc agcagagcgc agataccaaa 
ccaccacttc aagaactctg tagcaccgcc 
agtggctgct gccagtggcg ataagtcgtg 
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agcgcccgct cctttcgctt tcttcccttc 120 
tcaagctcta aatcgggggc tccctttagg 180 
ccccaaaaaa cttgattagg gtgatggttc 240 
ttttcgccct ttgacgttgg agtccacgtt 300 
aacaacactc aaccctatct cggtctattc 360 
ggcctattgg ttaaaaaatg agctgattta 420 
attaacgttt acaatttcag gtggcacttt 480 
tttatttttc taaatacatt caaatatgta 540 
gcttcaataa tattgaaaaa ggaagagtat 600 
tccctttttt gcggcatttt gccttcctgt 660 
aaaagatgct gaagatcagt tgggtgcacg 720 
cggtaagatc cttgagagtt ttcgccccga 780 
agttctgcta tgtggcgcgg tattatcccg 840 
ccgcatacac tattctcaga atgacttggt 900 
tacggatggc atgacagtaa gagaattatg 960 
tgcggccaac ttacttctga caacgatcgg 1020 
caacatgggg gatcatgtaa ctcgccttga 1080 
accaaacgac gagcgtgaca ccacgatgcc 1140 
attaactggc gaactactta ctctagcttc 1200 
ggataaagtt gcaggaccac ttctgcgctc 1260 
taaatctgga gccggtgagc gtgggtctcg 1320 
taagccctcc cgtatcgtag ttatctacac 1380 
aaatagacag atcgctgaga taggtgcctc 1440 
agtttactca tatatacttt agattgattt 1500 
ggtgaagatc ctttttgata atctcatgac 1560 
ctgagcgtca gaccccgtag aaaagatcaa 1620 
cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
tcaagagcta ccaactcttt ttccgaaggt 1740 
tactgtcctt ctagtgtagc cgtagttagg 1800 
tacatacctc gctctgctaa tcctgttacc 18 60 
tcttaccggg ttggactcaa gacgatagtt 1920 
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accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 1980 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 2040 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 2100 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 2160 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 2220 
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 2280 
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 2340 
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 2400 
gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg 24 60 
tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat 2520 
cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 
gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 
gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct 2700 
catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt 2760 
tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg 2820 
ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa 2880 
tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc 2940 
ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa 3000 
aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta 3060 
gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg 3120 
tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag 3180 
acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac 3240 
cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca 3300 
cccgtggcca ggacccaacg ctgcccgaga tctcgatccc gcgaaattaa tacgactcac 3360 
tatagggaga ccacaacggt ttccctctag aaataatttt gtttaacttt aagaaggaga 3420 



3469 



tatacat atg agt aat cat ggc cca gat gcc aca gaa get gaa gag gat 
Met Ser Asn His Gly Pro Asp Ala Thr Glu Ala Glu Glu Asp 
1 5 10 

ttt gtg gac cca tgg aca gta cag aca age agt gca aaa ggc ata gac 3517 
Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser Ala Lys Gly He Asp 
15 20 25 30 

tac gat aag etc att gtt egg ttt gga agt agt aaa att gac aaa gag 3565 
Tyr Asp Lys Leu He Val Arg Phe Gly Ser Ser Lys He Asp Lys Glu 

35 40 45 
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eta ata aac cga ata gag aga gec acc ggc caa aga cca cac cac ttc 3613 
Leu lie Asn Arg lie Glu Arg Ala Thr Gly Gin Arg Pro His His Phe 

50 55 60 

ctg cgc aga ggc ate ttc ttc tea cac aga gat atg aat cag gtt ctt 3661 
Leu Arg Arg Gly lie Phe Phe Ser His Arg Asp Met Asn Gin Val Leu 
65 70 75 

gat gee tat gaa aat aag aag cca ttt tat ctg tac acg ggc egg ggc 3709 
Asp Ala Tyr Glu Asn Lys Lys Pro Phe Tyr Leu Tyr Thr Gly Arg Gly 
80 85 90 

ccc tct tct gaa gca atg cat gta ggt cac etc att cca ttt att ttc 3757 
Pro Ser Ser Glu Ala Met His Val Gly His Leu He Pro Phe He Phe 
95 100 105 110 

aca aag tgg etc cag gat gta ttt aac gtg ccc ttg gtc ate cag atg 3805 
Thr Lys Trp Leu Gin Asp Val Phe Asn Val Pro Leu Val He Gin Met 

115 120 125 

acg gat gac gag aag tat ctg tgg aag gac ctg acc ctg gac cag gec 3853 
Thr Asp Asp Glu Lys Tyr Leu Trp Lys Asp Leu Thr Leu Asp Gin Ala 

130 135 140 

tat ggc gat get gtt gag aat gec aag gac ate ate gee tgt ggc ttt 3901 
Tyr Gly Asp Ala Val Glu Asn Ala Lys Asp He He Ala Cys Gly Phe 
145 150 155 

gac ate aac aag act ttc ata ttc tct gac ctg gac tac atg ggg atg 394 9 
Asp He Asn Lys Thr Phe He Phe Ser Asp Leu Asp Tyr Met Gly Met 
160 165 170 

age tea ggt ttc tac aaa aat gtg gtg aag att caa aag cat gtt acc 3997 
Ser Ser Gly Phe Tyr Lys Asn Val Val Lys He Gin Lys His Val Thr 
175 180 185 190 

ttc aac caa gtg aaa ggc att ttc ggc ttc act gac age gac tgc att 4045 
Phe Asn Gin Val Lys Gly He Phe Gly Phe Thr Asp Ser Asp Cys He 

195 200 205 

ggg aag ate agt ttt cct gee ate cag get get ccc tec ttc age aac 4093 
Gly Lys He Ser Phe Pro Ala He Gin Ala Ala Pro Ser Phe Ser Asn 

210 215 220 

tea ttc cca cag ate ttc cga gac agg acg gat ate cag tgc ctt ate 4141 
Ser Phe Pro Gin He Phe Arg Asp Arg Thr Asp He Gin Cys Leu He 
225 230 235 

cca tgt gee att gac cag gat cct tac ttt aga atg aca agg gac gtc 4189 
Pro Cys Ala He Asp Gin Asp Pro Tyr Phe Arg Met Thr Arg Asp Val 
240 245 250 

gee ccc agg ate ggc tat cct aaa cca gee ctg ttg cac tec acc ttc 4237 
Ala Pro Arg He Gly Tyr Pro Lys Pro Ala Leu Leu His Ser Thr Phe 
255 260 265 270 

ttc cca gec ctg cag ggc gec cag acc aaa atg agt gee age gac cca 4285 
Phe Pro Ala Leu Gin Gly Ala Gin Thr Lys Met Ser Ala Ser Asp Pro 

275 280 285 
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aac tec tec ate ttc etc ace gac acg gec aag cag ate aaa ace aag 4333 
Asn Ser Ser lie Phe Leu Thr Asp Thr Ala Lys Gin lie Lys Thr Lys 

290 295 300 

gtc aat aag cat gcg ttt tct gga ggg aga gac ace ate gag gag cac 4381 
Val Asn Lys His Ala Phe Ser Gly Gly Arg Asp Thr He Glu Glu His 
305 310 315 

agg cag ttt ggg ggc aac tgt gat gtg gac gtg tct ttc atg tac ctg 4429 
Arg Gin Phe Gly Gly Asn Cys Asp Val Asp Val Ser Phe Met Tyr Leu 
320 325 330 

ace ttc ttc etc gag gac gac gac aag etc gag cag ate agg aag gat 4477 
Thr Phe Phe Leu Glu Asp Asp Asp Lys Leu Glu Gin lie Arg Lys Asp 
335 340 345 350 

tac acc age gga gee atg etc acc ggt gag etc aag aag gca etc ata 4525 
Tyr Thr Ser Gly Ala Met Leu Thr Gly Glu Leu Lys Lys Ala Leu He 

355 360 365 

gag gtt ctg cag ccc ttg ate gca gag cac cag gee egg cgc aag gag 4573 
Glu Val Leu Gin Pro Leu He Ala Glu His Gin Ala Arg Arg Lys Glu 

370 375 380 

gtc acg gat gag ata gtg aaa gag ttc atg act ccc egg aag ctg tec 4 621 
Val Thr Asp Glu He Val Lys Glu Phe Met Thr Pro Arg Lys Leu Ser 
385 390 395 

ttc gac ttt cag aag ctt gcg gee gca etc gag cac cac cac cac cac 4 669 
Phe Asp Phe Gin Lys Leu Ala Ala Ala Leu Glu His His His His His 
400 405 410 

cac tgagatcegg ctgctaacaa ageccgaaag gaagctgagt tggctgctgc 4722 

His 

415 

caccgctgag caataactag cataacccct tggggectet aaaegggtet tgaggggttt 4782 
tttgctgaaa ggaggaacta tatceggat 4811 



<210> 14 
<211> 415 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human 
supermini TrpRS in pET20B 

<400> 14 

Met Ser Asn His Gly Pro Asp Ala Thr Glu Ala Glu Glu Asp Phe Val 
15 10 15 

Asp Pro Trp Thr Val Gin Thr Ser Ser Ala Lys Gly He Asp Tyr Asp 

20 25 30 

Lys Leu He Val Arg Phe Gly Ser Ser Lys He Asp Lys Glu Leu He 
35 40 45 



Asn Arg He Glu Arg Ala Thr Gly Gin Arg Pro His His Phe Leu Arg 
50 55 60 
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Arg Gly He Phe Phe Ser His Arg Asp Met Asn Gin Val Leu Asp Ala 
65 70 75 80 

Tyr Glu Asn Lys Lys Pro Phe Tyr Leu Tyr Thr Gly Arg Gly Pro Ser 

85 90 95 

Ser Glu Ala Met His Val Gly His Leu He Pro Phe He Phe Thr Lys 

100 105 110 

Trp Leu Gin Asp Val Phe Asn Val Pro Leu Val He Gin Met Thr Asp 
115 120 125 

Asp Glu Lys Tyr Leu Trp Lys Asp Leu Thr Leu Asp Gin Ala Tyr Gly 
130 135 140 

Asp Ala Val Glu Asn Ala Lys Asp He He Ala Cys Gly Phe Asp He 
145 150 155 160 

Asn Lys Thr Phe He Phe Ser Asp Leu Asp Tyr Met Gly Met Ser Ser 

165 170 175 

Gly Phe Tyr Lys Asn Val Val Lys He Gin Lys His Val Thr Phe Asn 

180 185 190 

Gin Val Lys Gly He Phe Gly Phe Thr Asp Ser Asp Cys He Gly Lys 
195 200 205 

He Ser Phe Pro Ala He Gin Ala Ala Pro Ser Phe Ser Asn Ser Phe 
210 215 220 

Pro Gin He Phe Arg Asp Arg Thr Asp He Gin Cys Leu He Pro Cys 
225 230 235 240 

Ala He Asp Gin Asp Pro Tyr Phe Arg Met Thr Arg Asp Val Ala Pro 

245 250 255 

Arg He Gly Tyr Pro Lys Pro Ala Leu Leu His Ser Thr Phe Phe Pro 

260 265 270 

Ala Leu Gin Gly Ala Gin Thr Lys Met Ser Ala Ser Asp Pro Asn Ser 
275 280 285 

Ser He Phe Leu Thr Asp Thr Ala Lys Gin He Lys Thr Lys Val Asn 
290 295 300 

Lys His Ala Phe Ser Gly Gly Arg Asp Thr He Glu Glu His Arg Gin 
305 310 315 320 

Phe Gly Gly Asn Cys Asp Val Asp Val Ser Phe Met Tyr Leu Thr Phe 

325 330 335 

Phe Leu Glu Asp Asp Asp Lys Leu Glu Gin He Arg Lys Asp Tyr Thr 

340 345 350 

Ser Gly Ala Met Leu Thr Gly Glu Leu Lys Lys Ala Leu He Glu Val 
355 360 365 



Leu Gin Pro Leu He Ala Glu His Gin Ala Arg Arg Lys Glu Val Thr 
370 375 380 
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Asp Glu lie Val Lys Glu Phe Met Thr Pro Arg Lys Leu Ser Phe Asp 
385 390 395 400 

Phe Gin Lys Leu Ala Ala Ala Leu Glu His His His His His His 

405 410 415 

<210> 15 
<211> 4742 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (3428) . . (4603) 
<220> 

<223> Description of Artificial Sequence: human minor 
Trp-RS fragment in pET20B 

<400> 15 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 24 0 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 600 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 660 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 720 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 7 80 
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 840 
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 900 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 960 
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 1020 
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 1080 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 1140 
tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 1200 
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ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 1260 
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 1320 
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 1380 
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 144 0 
actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 1500 
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 1560 
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 1620 
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 1740 
aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 1800 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 18 60 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 1920 
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 1980 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 2040 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 2100 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 2160 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 2220 
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 2280 
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 2340 
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 2400 
gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg 24 60 
tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat 2520 
cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 
gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 
gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct 2700 
catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt 2760 
tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg 2820 
ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa 2880 
tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc 2940 
ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa 3000 
aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta 3060 
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gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg 3120 

tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag 3180 

acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac 3240 

cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca 3300 

cccgtggcca ggacccaacg ctgcccgaga tctcgatccc gcgaaattaa tacgactcac 3360 

tatagggaga ccacaacggt ttccctctag aaataatttt gtttaacttt aagaaggaga 3420 

tatacat atg agt gca aaa ggc ata gac tac gat aag etc att gtt egg 34 69 
Met Ser Ala Lys Gly lie Asp Tyr Asp Lys Leu lie Val Arg 
15 10 

ttt gga agt agt aaa att gac aaa gag eta ata aac cga ata gag aga 3517 
Phe Gly Ser Ser Lys lie Asp Lys Glu Leu lie Asn Arg lie Glu Arg 
15 20 25 30 

gec acc ggc caa aga cca cac cac ttc ctg cgc aga ggc ate ttc ttc 3565 
Ala Thr Gly Gin Arg Pro His His Phe Leu Arg Arg Gly lie Phe Phe 

35 40 45 

tea cac aga gat atg aat cag gtt ctt gat gee tat gaa aat aag aag 3613 
Ser His Arg Asp Met Asn Gin Val Leu Asp Ala Tyr Glu Asn Lys Lys 

50 55 60 

cca ttt tat ctg tac acg ggc egg ggc ccc tct tct gaa gca atg cat 3661 
Pro Phe Tyr Leu Tyr Thr Gly Arg Gly Pro Ser Ser Glu Ala Met His 

65 70 75 

gta ggt cac etc att cca ttt att ttc aca aag tgg etc cag gat gta 3709 
Val Gly His Leu He Pro Phe He Phe Thr Lys Trp Leu Gin Asp Val 
80 85 90 

ttt aac gtg ccc ttg gtc ate cag atg acg gat gac gag aag tat ctg 3757 
Phe Asn Val Pro Leu Val He Gin Met Thr Asp Asp Glu Lys Tyr Leu 
95 100 105 110 

tgg aag gac ctg acc ctg gac cag gee tat ggc gat get gtt gag aat 3805 
Trp Lys Asp Leu Thr Leu Asp Gin Ala Tyr Gly Asp Ala Val Glu Asn 

115 120 125 

gee aag gac ate ate gee tgt ggc ttt gac ate aac aag act ttc ata 3853 
Ala Lys Asp He He Ala Cys Gly Phe Asp He Asn Lys Thr Phe He 

130 135 140 

ttc tct gac ctg gac tac atg ggg atg age tea ggt ttc tac aaa aat 3901 
Phe Ser Asp Leu Asp Tyr Met Gly Met Ser Ser Gly Phe Tyr Lys Asn 
145 150 155 

gtg gtg aag att caa aag cat gtt acc ttc aac caa gtg aaa ggc att 3949 
Val Val Lys He Gin Lys His Val Thr Phe Asn Gin Val Lys Gly He 
160 165 170 

ttc ggc ttc act gac age gac tgc att ggg aag ate agt ttt cct gee 3997 
Phe Gly Phe Thr Asp Ser Asp Cys He Gly Lys He Ser Phe Pro Ala 
175 180 185 190 
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ate cag get get ccc tec ttc age aac tea ttc cca cag ate ttc cga 4045 
lie Gin Ala Ala Pro Ser Phe Ser Asn Ser Phe Pro Gin lie Phe Arg 

195 200 205 



gac agg acg gat ate cag tgc ctt ate cca tgt gee att gac cag gat 
Asp Arg Thr Asp lie Gin Cys Leu lie Pro Cys Ala lie Asp Gin Asp 

210 215 220 



4093 



cct tac ttt aga atg aca agg gac gtc gee ccc agg ate ggc tat cct 4141 
Pro Tyr Phe Arg Met Thr Arg Asp Val Ala Pro Arg lie Gly Tyr Pro 
225 230 235 

aaa cca gec ctg ttg cac tec acc ttc ttc cca gec ctg cag ggc gee 4189 
Lys Pro Ala Leu Leu His Ser Thr Phe Phe Pro Ala Leu Gin Gly Ala 
240 245 250 

cag acc aaa atg agt gee age gac cca aac tec tec ate ttc etc acc 4237 
Gin Thr Lys Met Ser Ala Ser Asp Pro Asn Ser Ser lie Phe Leu Thr 
255 260 265 270 

gac acg gec aag cag ate aaa acc aag gtc aat aag cat gcg ttt tct 4285 
Asp Thr Ala Lys Gin lie Lys Thr Lys Val Asn Lys His Ala Phe Ser 

275 280 285 

gga ggg aga gac acc ate gag gag cac agg cag ttt ggg ggc aac tgt 4 333 
Gly Gly Arg Asp Thr He Glu Glu His Arg Gin Phe Gly Gly Asn Cys 

290 295 300 

gat gtg gac gtg tct ttc atg tac ctg acc ttc ttc etc gag gac gac 4381 
Asp Val Asp Val Ser Phe Met Tyr Leu Thr Phe Phe Leu Glu Asp Asp 
305 310 315 

gac aag etc gag cag ate agg aag gat tac acc age gga gee atg etc 4429 
Asp Lys Leu Glu Gin He Arg Lys Asp Tyr Thr Ser Gly Ala Met Leu 
320 325 330 

acc ggt gag etc aag aag gca etc ata gag gtt ctg cag ccc ttg ate 4477 
Thr Gly Glu Leu Lys Lys Ala Leu He Glu Val Leu Gin Pro Leu He 
335 340 345 350 

gca gag cac cag gec egg cgc aag gag gtc acg gat gag ata gtg aaa 4525 
Ala Glu His Gin Ala Arg Arg Lys Glu Val Thr Asp Glu He Val Lys 

355 360 365 

gag ttc atg act ccc egg aag ctg tec ttc gac ttt cag aag ctt gcg 4573 
Glu Phe Met Thr Pro Arg Lys Leu Ser Phe Asp Phe Gin Lys Leu Ala 

370 375 380 

gee gca etc gag cac cac cac cac cac cac tgagatcegg ctgctaacaa 4 623 
Ala Ala Leu Glu His His His His His His 
385 390 

ageccgaaag gaagctgagt tggctgctgc caccgctgag caataactag cataacccct 4683 

tggggectet aaaegggtet tgaggggttt tttgctgaaa ggaggaacta tatceggat 4742 
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<210> 16 
<211> 392 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human minor 
TrpRS fragment in pET20B 

<400> 16 

Met Ser Ala Lys Gly lie Asp Tyr Asp Lys Leu lie Val Arg Phe Gly 
15 10 15 

Ser Ser Lys lie Asp Lys Glu Leu lie Asn Arg lie Glu Arg Ala Thr 

20 25 30 

Gly Gin Arg Pro His His Phe Leu Arg Arg Gly lie Phe Phe Ser His 
35 40 45 

Arg Asp Met Asn Gin Val Leu Asp Ala Tyr Glu Asn Lys Lys Pro Phe 
50 55 60 

Tyr Leu Tyr Thr Gly Arg Gly Pro Ser Ser Glu Ala Met His Val Gly 
65 70 75 80 

His Leu lie Pro Phe lie Phe Thr Lys Trp Leu Gin Asp Val Phe Asn 

85 90 95 

Val Pro Leu Val lie Gin Met Thr Asp Asp Glu Lys Tyr Leu Trp Lys 

100 105 110 

Asp Leu Thr Leu Asp Gin Ala Tyr Gly Asp Ala Val Glu Asn Ala Lys 
115 120 125 

Asp lie lie Ala Cys Gly Phe Asp lie Asn Lys Thr Phe lie Phe Ser 
130 135 140 

Asp Leu Asp Tyr Met Gly Met Ser Ser Gly Phe Tyr Lys Asn Val Val 
145 150 155 160 

Lys He Gin Lys His Val Thr Phe Asn Gin Val Lys Gly He Phe Gly 

165 170 175 

Phe Thr Asp Ser Asp Cys He Gly Lys He Ser Phe Pro Ala He Gin 

180 185 190 

Ala Ala Pro Ser Phe Ser Asn Ser Phe Pro Gin He Phe Arg Asp Arg 
195 200 205 

Thr Asp He Gin Cys Leu He Pro Cys Ala He Asp Gin Asp Pro Tyr 
210 215 220 

Phe Arg Met Thr Arg Asp Val Ala Pro Arg He Gly Tyr Pro Lys Pro 
225 230 235 240 

Ala Leu Leu His Ser Thr Phe Phe Pro Ala Leu Gin Gly Ala Gin Thr 

245 250 255 



Lys Met Ser Ala Ser Asp Pro Asn Ser Ser He Phe Leu Thr Asp Thr 

260 265 270 
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Ala Lys Gin lie Lys Thr Lys Val Asn Lys His Ala Phe Ser Gly Gly 
275 280 285 

Arg Asp Thr lie Glu Glu His Arg Gin Phe Gly Gly Asn Cys Asp Val 
290 295 300 

Asp Val Ser Phe Met Tyr Leu Thr Phe Phe Leu Glu Asp Asp Asp Lys 
305 310 315 320 

Leu Glu Gin He Arg Lys Asp Tyr Thr Ser Gly Ala Met Leu Thr Gly 

325 330 335 

Glu Leu Lys Lys Ala Leu He Glu Val Leu Gin Pro Leu He Ala Glu 

340 345 350 

His Gin Ala Arg Arg Lys Glu Val Thr Asp Glu He Val Lys Glu Phe 
355 360 365 

Met Thr Pro Arg Lys Leu Ser Phe Asp Phe Gin Lys Leu Ala Ala Ala 
370 375 380 



Leu Glu His His His His His His 
385 390 



<210> 17 
<211> 6 
<212> PRT 

<213> Homo sapiens 
<400> 17 

Glu Leu Arg Val Ser Tyr 
1 5 



<210> 18 
<211> 6 
<212> PRT 

<213> Escherichia coli 
<400> 18 

Glu Thr Val Gin Glu Trp 
1 5 



<210> 19 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 19 

Ser Ala Lys Glu Leu Arg Cys Gin Cys 
1 5 
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<210> 20 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 20 

Ala Ser Val Ala Thr Glu Leu Arg Cys Gin Cys 
15 10 



<210> 21 
<211> 7 
<212> PRT 

<213> Homo sapiens 
<400> 21 

Ala Glu Leu Arg Cys Gin Cys 
1 5 



<210> 22 
<211> 58 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Gly Asp Glu Lys Lys Ala Lys Glu Lys lie Glu Lys Lys Gly Glu Lys 
15 10 15 

Lys Glu Lys Lys Gin Gin Ser lie Ala Gly Ser Ala Asp Ser Lys Pro 

20 25 30 

lie Asp Val Ser Arg Leu Asp Leu Arg lie Gly Cys lie lie Thr Ala 
35 40 45 

Arg Lys His Pro Asp Ala Asp Ser Leu Tyr 
50 55 



<210> 23 

<211> 58 
<212> PRT 

<213> Homo sapiens 
<400> 23 

Pro Ala Leu Lys Lys Leu Ala Ser Ala Ala Tyr Pro Asp Pro Ser Lys 
15 10 15 

Gin Lys Pro Met Ala Lys Gly Pro Ala Lys Asn Ser Glu Pro Glu Glu 

20 25 30 

Val lie Pro Ser Arg Leu Asp lie Arg Val Gly Lys lie lie Thr Val 

35 40 45 

Glu Lys His Pro Asp Ala Asp Ser Leu Tyr 
50 55 



<210> 24 
<211> 7 
<212> PRT 

<213> Homo sapiens 
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<400> 24 

Arg Val Gly Lys lie lie Thr 
1 5 



<210> 25 
<211> 7 
<212> PRT 

<213> Homo sapiens 
<400> 25 

Arg lie Gly Cys lie lie Thr 
1 5 



<210> 26 
<211> 7 
<212> PRT 

<213> Homo sapiens 
<400> 26 

Arg lie Gly Arg lie lie Thr 
1 5 



<210> 27 
<211> 7 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 27 

Arg Val Gly Arg lie lie Lys 
1 5 



<210> 28 
<211> 7 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 28 

Arg Val Gly Phe He Gin Lys 
1 5 



<210> 29 
<211> 7 
<212> PRT 
<213> Bos taurus 

<400> 29 

Arg Val Gly Lys Val He Ser 
1 5 



<210> 30 
<211> 7 
<212> PRT 

<213> Mus musculus 



<400> 30 
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Arg He Gly Cys He Val Thr 
1 5 



<210> 31 
<211> 7 
<212> PRT 

<213> Mesocricetus auratus 
<400> 31 

Arg He Gly Arg He Val Thr 
1 5 



<210> 32 

<211> 7 

<212> PRT 

<213> Ovis aries 

<400> 32 

Arg He Gly Cys He He Thr 
1 5 



<210> 33 

<211> 7 

<212> PRT 

<213> Calcarea sp. 

<400> 33 

Arg He Gly Arg He Thr Ser 
1 5 



<210> 34 

<211> 7 

<212> PRT 

<213> A. aeolicus 

<400> 34 

Arg Val Ala Lys Val Leu Ser 
1 5 



<210> 35 
<211> 7 
<212> PRT 

<213> Escherichia coli 
<400> 35 

Arg Val Gly Lys He Val Glu 
1 5 



<210> 36 
<211> 7 
<212> PRT 

<213> Escherichia coli 



<400> 36 

Arg Val Ala Leu He Glu Asn 
1 5 
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<210> 37 
<211> 7 
<212> PRT 

<213> Haemophilus influenzae 
<400> 37 

Arg Val Ala Lys Val Leu Lys 
1 5 



<210> 38 
<211> 7 
<212> PRT 

<213> Bacillus subtilis 
<400> 38 

Arg Val Ala Glu Val He Glu 
1 5 



<210> 39 
<211> 7 
<212> PRT 

<213> B. stearothermophilus 
<400> 39 

Arg Val Ala Glu Val Val Gin 
1 5 



<210> 40 
<211> 7 
<212> PRT 

<213> Thermus thermophilus 
<400> 40 

Arg Val Ala Glu Val Leu Ala 
1 5 



<210> 41 
<211> 6 
<212> PRT 

<213> Escherichia coli 
<400> 41 

Val Gly Glu Val Val Glu 
1 5 



<210> 42 
<211> 6 
<212> PRT 

<213> Bacillus subtilis 



<400> 42 

He Gly His Val Leu Glu 
1 5 
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<210> 43 
<211> 6 
<212> PRT 

<213> Synechococcus sp. 
<400> 43 

Val Gly Arg Val Leu Glu 
1 5 



<210> 44 
<211> 6 
<212> PRT 

<213> Thermus thermophilus 
<400> 44 

Phe Ala Arg Val Leu Glu 
1 5 



<210> 45 

<211> 85 

<212> PRT 

<213> Homo sapiens 

<400> 45 

Met Ser Tyr Lys Ala Ala Ala Gly Glu Asp Tyr Lys Ala Asp Cys Pro 
15 10 15 

Pro Gly Asn Pro Ala Pro Thr Ser Asn His Gly Pro Asp Ala Thr Glu 

20 25 30 

Ala Glu Glu Asp Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser Ala 
35 40 45 

Lys Gly He Asp Tyr Asp Lys Leu He Val Arg Phe Gly Ser Ser Lys 
50 55 60 

He Asp Lys Glu Leu He Asn Arg He Glu Arg Ala Thr Gly Gin Arg 
65 70 75 80 

Pro His His Phe Leu 

85 



<210> 46 
<211> 85 
<212> PRT 
<213> Bos taurus 

<400> 46 

Thr Ser Tyr Lys Ala Ala Thr Gly Glu Asp Tyr Lys Val Asp Cys Pro 
15 10 15 

Pro Gly Asp Pro Ala Pro Glu Ser Gly Glu Gly Leu Asp Ala Thr Glu 

20 25 30 

Ala Asp Glu Asp Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser Ala 

35 40 45 

Lys Gly He Asp Tyr Asp Lys Leu He Val Arg Phe Gly Ser Ser Lys 
50 55 60 
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He Asp Lys Glu Leu Val Asn Arg He Glu Arg Ala Thr Gly Gin Arg 
65 70 75 80 

Pro His Arg Phe Leu 

85 



<210> 47 
<211> 85 
<212> PRT 

<213> Mus musculus 
<400> 47 

Met Ser Tyr Lys Ala Ala Met Gly Glu Glu Tyr Lys Ala Gly Cys Pro 
15 10 15 

Pro Gly Asn Pro Thr Ala Gly Arg Asn Cys Asp Ser Asp Ala Thr Lys 

20 25 30 

Ala Ser Glu Asp Phe Val Asp Pro Trp Thr Val Arg Thr Ser Ser Ala 
35 40 45 

Lys Gly He Asp Tyr Asp Lys Leu He Val Gin Pro Gly Ser Ser Lys 
50 55 60 

He Asp Lys Glu Leu He Asn Arg He Glu Arg Ala Thr Gly Gin Arg 
65 70 75 80 

Pro His Arg Phe Leu 

85 



<210> 48 
<211> 85 
<212> PRT 

<213> Oryctolagus cuniculus 
<400> 48 

Thr Ser Tyr Lys Glu Ala Met Gly Glu Asp Tyr Lys Ala Asp Cys Pro 
15 10 15 

Pro Gly Asn Ser Thr Pro Asp Ser His Gly Pro Asp Glu Ala Val Asp 

20 25 30 

Asp Lys Glu Asp Phe Val Asp Pro Trp Thr Val Arg Thr Ser Ser Ala 
35 40 45 

Lys Gly He Asp Tyr Asp Lys Leu He Val Gin Phe Gly Ser Ser Lys 
50 55 60 

He Asp Lys Glu Leu Val Asn Arg He Glu Arg Ala Thr Gly Gin Arg 
65 70 75 80 



Pro His Arg Phe Leu 

85 
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<210> 49 

<211> 86 
<212> PRT 

<213> Homo sapiens 
<400> 49 

lie Ser Tyr Gin Gly Arg lie Pro Tyr Pro Arg Pro Gly Thr Cys Pro 
15 10 15 

Gly Gly Ala Phe Thr Pro Asn Met Arg Thr Thr Lys Glu Phe Pro Asp 

20 25 30 

Asp Val Val Thr Phe lie Arg Asn His Pro Leu Met Tyr Asn Ser lie 

35 40 45 

Tyr Pro lie His Lys Arg Pro Leu lie Val Arg lie Gly Thr Asp Tyr 
50 55 60 

Lys Tyr Thr Lys lie Ala Val Asp Arg Val Asn Ala Ala Asp Gly Arg 
65 70 75 80 

Tyr His Val Leu Phe Leu 

85 



<210> 50 
<211> 86 
<212> PRT 

<213> Mus musculus 
<400> 50 

lie Ser Tyr Gin Gly Arg lie Pro Tyr Pro Arg Pro Gly Thr Cys Pro 
15 10 15 

Gly Gly Ala Phe Thr Pro Asn Met Arg Thr Thr Lys Asp Phe Pro Asp 

20 25 30 

Asp Val Val Thr Phe lie Arg Asn His Pro Leu Met Tyr Asn Ser lie 
35 40 45 

Ser Pro lie His Arg Arg Pro Leu lie Val Arg lie Gly Thr Asp Tyr 
50 55 60 

Lys Tyr Thr Lys lie Ala Val Asp Arg Val Asn Ala Ala Asp Gly Arg 
65 70 75 80 



Tyr His Val Leu Phe Leu 

85 



<210> 51 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<400> 51 

Ala Ala Ala Gly Glu Asp Tyr Lys Ala Asp Cys Pro Pro Gly Asn Pro 
15 10 15 

Ala Pro Thr Ser Asn His Gly Pro Asp Ala Thr Glu Ala Glu Glu Asp 

20 25 30 
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Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser Ala Lys Gly 
35 40 45 



<210> 52 
<211> 46 
<212> PRT 
<213> Bos taurus 

<400> 52 

Ala Ala Thr Gly Glu Asp Tyr Lys Val Asp Cys Pro Pro Gly Asp Pro 
15 10 15 

Ala Pro Glu Ser Gly Glu Gly Leu Asp Ala Thr Glu Ala Asp Glu Asp 

20 25 30 

Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser Ala Lys Gly 
35 40 45 



<210> 53 
<211> 46 
<212> PRT 

<213> Mus musculus 
<400> 53 

Ala Ala Met Gly Glu Glu Tyr Lys Ala Gly Cys Pro Pro Gly Asn Pro 
15 10 15 

Thr Ala Gly Arg Asn Cys Asp Ser Asp Ala Thr Lys Ala Ser Glu Asp 

20 25 30 

Phe Val Asp Pro Trp Thr Val Arg Thr Ser Ser Ala Lys Gly 
35 40 45 



<210> 54 
<211> 46 
<212> PRT 

<213> Oryctolagus cuniculus 
<400> 54 

Glu Ala Met Gly Glu Asp Tyr Lys Ala Asp Cys Pro Pro Gly Asn Ser 
15 10 15 

Thr Pro Asp Ser His Gly Pro Asp Glu Ala Val Asp Asp Lys Glu Asp 

20 25 30 

Phe Val Asp Pro Trp Thr Val Arg Thr Ser Ser Ala Lys Gly 
35 40 45 



<210> 55 
<211> 41 
<212> PRT 

<213> Mus musculus 
<400> 55 

Ala Phe Ala Gly Glu Asp Phe Lys Val Asp lie Pro Glu Thr His Gly 
15 10 15 
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Gly Glu Gly Thr Glu Asp Glu He Asp Asp Glu Tyr Glu Gly Asp Trp 

20 25 30 

Ser Asn Ser Ser Ser Ser Thr Ser Gly 
35 40 



<210> 56 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<400> 56 

Met Gly Asp Ala Pro 
1 5 



<210> 57 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<400> 57 

Ser Asn His Gly Pro 
1 5 



<210> 58 

<211> 5 

<212> PRT 

<213> Homo sapiens 

<400> 58 

Ser Ala Lys Gly He 
1 5 



1 



1 



